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PREFACE 


This book is intended for the student of physics or chemistry who 
wishes to use quantum mechanics. Not much space, therefore, is de- 
voted to the foundations of the subject ; in particular no attempt is 
made to present the foundations in their most general form, as is done 
in the classical treatises of Dirac and of Pauli, for example. SchrOdinger’s 
wave equation for a single particle in an electrostatic field is shown to 
follow from diffraction experiments; generalizations of the equation 
for the magnetic field, the many-body problem, for the spin, and for 
relativistic velocities are introduced as and when the physical sub- 
jects under discussion make it necessary. This treatment, though it 
lacks the elegance of the more general formulation, has the advantage 
of keeping close to experimental fact. It should appeal to the research 
worker who wants to learn the subject in order to explain new facts of 
physical science, rather than to the man whose interest is in the logical 
basis of quantum mechanics, and in the improvements necessary in 
those parts that remain unsatisfactory. 

We are grateful to Dr. A. F. Devonshire and Dr. K. Guggenheimer 
who have read parts of the book in manuscript or in proof, and made 
suggestions for its improvement. 

N. F. M. 
I. N. S. 

April 1948 
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NUCl£AH FHYSirS DIVISION. 
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INTRODUCTION 


Quantum or Wave Mechanics is the name given to a system of equations 
•which must be used instead of Newton’s Laws of Motion in order to 
calculate the behaviour of atoms and electrons and other ultimate 
particles of matter. Newton’s laws have proved adequate for the 
mathematical description of a very wide class of phenomena, including, 
for instance, the motion of the planets round the sun and the motion 
of projectiles under the influence of gravity and air resistance. When, 
however, they are applied to the behaviour of electrons, and in particular 
to the motion of electrons within the atom, the answer that they give 
is not in agreement with experiment. Consequently it has been neces- 
saiy to invent a new system of equations which gives as completely as 
possible an answer which does fit the facts. These equations, while 
profoundly different from Newton’s laws, give very nearly the same 
results except when applied to particles of extremely small mass such 
as electrons ; this will be shown in Chapter I of this book. Therefore it 
must not be thought that two different sets of equations exist, one for 
atoms and electrons and one for the heavier bodies of everyday ex- 
perience ; Newton’s laws can be considered a special case of the more 
general quantalf laws. 

The first step in the development of the new system of mechanics is 
due to the German physicist Max Planck, who in 1900 made the assump- 
tion that the energy of light was quantized, in order to explain the 
spectrum of black-body radiation. J He introduced into physics a new 
constant of nature, Planck’s constant. In 1905 Albert Einstein made 
use of Planck’s assumption to account for the photoelectric effect. In 
1913 Niels Bohr, the great Danish leader of theoretical physics who was 
working at that time with Rutherford in Manchester, made use of the 
idea of quantization and of Planck’s constant to give a quantitative 
theory§ of the line spectrum of hydrogen. In this theory, the outlines 
of which are given in Chapter II of this book, Newton’s Laws of Motion 
were accepted as the correct equations for the electrons in atoms. In 
the hydrogen atom the single electron was therefore a^umed to revolve 
round the positively charged nucleus in circular or elliptic orbits. The 
new assumption was that only certain of these orbits were allowed, 

t The word *quaiital* has been introduced as an adjective to describe that which 
pertains to the quantum or to quantum mechanics. 

t M. Plank, Ver. d. Deut. Phys. QeaeUschaft, 2, 237 (1900). 

I N. Bohr, Phil. Mag. 26, 1 476, 867 (1913). 
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those for which the angular momentum was an integral multiple of 
h denoting Planck’s constant. 

In the hands of Niels Bohr in Copenhagen, Arnold Sommerfeld in 
Munich, and a number of other investigators, a fairly detailed theory 
of atomic and molecular spectra was developed along these lines ;t this 
was known as the Quantum Theory. The theory was brilliantly success- 
ful in many directions and its formulation was the work of a man of 
genius ; nevertheless it became apparent in a very few years that it, too, 
gave the wrong answer in many cases, and between 1924 and 1927 the 
completely new system of equations which we call Quantum Mechanics 
was put forward. In this system Newtonian mechanics is abandoned 
completely and the quantization of Bohr’s theory appears as a con- 
sequence of the equations. These developments were carried out under 
the guidance and inspiration of Niels Bohr, by a large number of 
investigators— in particular, de Broglie in France,}: SchrOdinger in 
Switzerland, Heisenberg, Pauli, and Bom in Germany, § and by Dirac 
in England. II The theory was originally presented in two different 
forms, the ‘Matrix Mechanics’ and the ‘Wave Mechanics’. Both 
formulations are mathematically equivalent and lead to the same 
results. In most of this book the methods of Wave Mechanics will be 
used, because they are most suitable for the applications of the theory 
to the simpler problems of physics and chemistry. 

We shall not attempt in this book to follow closely the historical 
development of the subject. Before introducing the equations of 
quantum mechanics, we shall give an outline of the experimental facts 
which have made the development of these equations necessary. We 
shall then show how these facts force us to invent new laws of motion 
quite different from Newton’s laws. These laws will then be developed 
and applied to a number of physical phenomena. 

t See, for example, N. Bohr, ‘On the quantum theory of line spectra*, D. Kgl, Dc^nake 
Vid,Selak, Skrifter^ Nat. Math, iv, 1 (1918); Vber die QwrUenIheorie der lAnimapehtrm 
(Braunschweig, 1923). 

I L. de Broglie, Ann. de Phyaiquet 3, 22 (1925). 

§ W. Heisenberg, Zeita. f. Phyaik, 33, 879 (1925) ; M. Bom and P. Jordan, ibid. 34, 
868 ( 1926) ; M. Bora, P. Jordan, and W. Heisenberg, ibid. 35, 667 (1926) ; E. Schrddinger, 
Ann. derPhyaik, 79, 361, 489, 734 (1926). 

j| P. A. M. Dirac, Proc. Boy. Soc. A, 109, 642 (1926) ; 112, 661 (1926) ; 113, 621 (1927) ; 
114, ^8, 710 (1927). 
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THE SCHRODINGER EQUATION 


1 . The wave nature of matter 

1.1. Interaction of matter and radiation 

One of the most obvious facts about light is that opaque bodies cast 
shadows, and that in casting shadows light appears to travel in straight 
lines. It was this fact which led Sir Isaac Newton to put forward his 
corpuscular theory of light, according to which energy is conveyed away 
from a source of light in the form of the kinetic energy of a stream of 
particles. The reason why the corpuscular 
tlieory was abandoned, and replaced by 
the wave theory, will be familiar to every 
reader of this book, namely that light 
shows interference. The interference is 
showji in its simplest form in Fig. 1. A 
point source of light S is separated from 
a screen CD by an opaque screen KF 
with pin-holes at A and i^, /SA being 
equal to SB. The pin-holes will both act 
as sources of light. If the nature of light 
were corpuscular, the illumination on the screen CD from the pin-holes 
A and B could be added together; thus the illumination at all points 
on the screen is greater when both holes are open than when one is. 
Since, however, light is propagated as a wave motion, this is not the 
case; the two sources reinforce each other at points P such that 
PA~PB is zero or an integral multiple of a wave-length, but interfere 
destructively if PA — PB differs from this by a half wave-length, so 
that the two sources of light spreading from A and B are out of 
phase. 

All direct ways of measuring the wave-length of light, such as the 
use of the diffraction grating, are based on the interference principle. 
This is true also of methods for measuring the wave-length of X-rays 
by crystal spectroscopy. 

Maxwell’s theory of the electromagnetic field has shown us what it 
is that oscillates in a light wave. According to this theory, it is the 
electric vector Ef defined as the force that would act on a unit positive 
electric charge at any point, and the magnetic vector H defined in an 

6072 B 



Fig. 1. Interference experiment, 
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analogous way. A plane polarized wave, moving in the z-direction and 
with its electric vector in the ^-direction, will have the form 


E^ — Ami 27r(v^-“Z/A), Ey = Eg = 0, 

= A sin 27r(v^— z/A), = o. 

Here v is the frequency and A the wave-length; if the wave is in a vacuum 
the wave velocity is c (= 3x 10'® cm./sec.), so that 

v\ = c. 


We now know that electromagnetic waves can exist with a great variety 
of wave-lengths between several kilometres and that of the hardest 
y-radiation, less than 10“'^ cm. Down to values of the order 10“'^ cn^., 
measurements have been made by methods of crystal spectroscopy, 
though for shorter wave-lengths there is no direct evidence for the 
wave-nature of the radiation. 

For radiation of the wave-lengths used in radio transmission, that is 
to say, greater than a few centimetres, analysis based on Maxwell’s 
theory appears completely satisfactory, both as regards the transmis- 
sion of the waves and their reaction on the receiving aerials. f In the 
receiving aerials currents are induced; these can be calculated in two 
ways which are mathematically equivalent; either by considering the 
electric vector E as being responsible for setting up an electromotive 
force along the aerial, or the magnetic flux threading the circuit as 
inducing a current in it. 

When we turn to radiation of shorter wave-lengths, that is to say, 
to light and to X-rays, the position of the wave theory in general and 
Maxwell’s theory in particular is less satisfactory. It appears to be 
correct when applied to problems of propagation such as reflection, 
diffraction, and refraction of radiation; in fact, as already emphasized, 
the facts of diffraction and interference compel the adoption of a wave 
theory. But it does not correctly describe what happens when light 
is observed, or more precisely when light gives up its energy to matter. 
When this happens, phenomena alimya occur which suggest that light 
consists of a stream of particles rather than a continuous train of waves. 
An appreciation of this dual nature of light is essential for an under- 
standing of quantum mechanics, and we must consider it in some detail. 

Leaving out of consideration the hum^ eye, the most important 

t Quantum effects with centimetre waves have, however, been observed, especially 
connected with nuclear spins. Cf. the report by J. B. M. Kellogg and S. Millman, Rev, « 
Mod, Phye, 18, 323 (1946). 
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ways in which light and radiations of shorter wave-length are observed 
and measured are: 

1. Photography; 

2. Photo-conductivity, or the property that light of suitable wave- 
length has of making certain solids and liquids conduct ; 

3. The photo-electric effect, or the property that light and especially 
ultra-violet light has of releasing electrons from the surfaces of 
solids or from gas molecules. 

Modern research (cf. Chap. VIII) has shown that the primary action 
of light on photographic emulsions is to make the halide grains photo- 
conductors; in discussing the quantum nature of light, therefore, it 
need not be considered separately. The photo-electric effect shows 
most clearly the properties of light which we wish to emphasize, and 
this will be discussed first. 

What happens when light falls on the surface of a metal or of some 
other conducting material is briefly as follows: if the freqiiency of the 
light is too small, there is no emission of electrons. As the frequency 
is increased, emission begins at a definite frequency vq. If the frequency 
V is greater than vq, electrons will be emitted with kinetic energies less 
than or equal to a maximum value given by 

^^ax ~ *^o)* 

h is here Planck’s constant. This equation and the experimental facts 
on which it is based are perhaps the logical basis for the definition of 
this constant. 

These facts show that, when light gives up its energy to matter by 
ejecting photo-electrons, it behaves as though it were a stream of 
particles each of energy hv. The energy hv^ will represent the minimum 
energy necessary to eject an electron from the surface. All this was 
pointed out by Einstein in 1905; the hypothesis of the quantization 
of light had already been introduced by Planck to explain the facts 
about black-body radiation. 

The phenomenon of photo-conductivity points in the same direction. 
All photo-conducting substances show a lower limit vq above which 
the frequency of the radiation must lie if the effect is to occur. On the 
basis of Einstein’s hypothesis hvQ will be the minimum energy that an 
electron must absorb in order to escape from the molecules or atoms 
of the solid into a free state in which it can move through it. The 
nature of this free state need not concern us here and will be discussed 
in a later chapter. The important thing to notice is that a lower limit 
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oi frequency exists below which the effect does not occur. The existence 
of a lower limit is common to all photo-electric and photo-chemical 
effects and in fact to every mechanism whereby light gives up energy 
to matter except that of inducing currents in conductors. Here there 
is no lower limit, as we know from the behaviour of radio waves which 
induce currents in aerials, the energy of which can be dissipated as heat 
or absorbed in a receiver. This apparent escape from the quantized 
nature of light is connected with the existence of free electrons in 
metals. 

We must examine further the nature of these light quanta, and since 
they move with the velocity c, equal to 3 x 10^** cm./sec., we must make 
use of the formulae of the Principle of Relativity in discussing their 
properties. According to this principle, the energy of a particle of mass 
m moving with velocity v is 



For small values of vjc this reduces to mc^+lmv^; the second term is 
the familiar kinetic energy, the first represents the energy that would 
be liberated if the particle were annihilated. Now, from this formula 
it is apparent at once that since a light quantum moves with velocity c 
and has finite energy, its mass must be zero. A light quantum there^ 
fore must be thought of simply as a little bundle of energy, and when 
it loses its energy it has no mass or other measurable attribute, and can 
be considered to cease to exist. 

The question arises whether a light quantum can give up part of its 
energy to matter, or whether it always gives up all or none as in the 
photo-electric effect. The effects associated with the names of Ramanf 
and of ComptonJ show that it can. We cannot discuss these fully until 
we have treated the quantization of atomic energy levels; briefiy we 
may say that under certain conditions light after being scattered by 
gas molecules may have a component with lower frequency than the 
original. In terms of the quantum hypothesis, part of the energy of 
each quantum is absorbed by each molecule. 

, Another formula from the Principle of Relativity is useful, namely 
that connecting kinetic energy {W) with momentum (p). This is 

For light quanta, for which m vanishes, this gives 

W = cp, 

t Q. y. Raman and R. S. Krishnan, Nature^ 121, 601 (1928). 

X A. H.> Compton, Phys. Bev. 21, 483 (1923); P. Debye, Phya. Zeiia. 24, 161 (1923). 
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Therefore the momentum of a light quantum is hvjo. This formula 
gives an easy way of calculating the pressure of light, since it shows 
that if light falls on a totally absorbing surface the momentum trans- 
mitted is equal to the energy absorbed divided by c. Thus 

c X pressure = energy absorbed per unit area per unit time. 

Also the above formula for the momentum of a light quantum has been 
applied to explain the Compton effect. 

A beam of light can thus be described in two ways: (1) as a beam of 
particles, each of energy W and momentum p; (2) as a wave of frequency 
V and wave-length A. Instead of the wave-length it is often convenient 
to introduce the wave-number equal to the reciprocal of A, 

K = 1/A. 

Like the momentum p, is a vector; for the wave amplitude can be 

SO the quantity K^x-^Kyy-\-K^z is scalar; that is to say, it is inde- 
pendent of the coordinate axes chosen. Thus since (x, y, z) is a vector, 
so is [K^,Ky,K,), 

Now since the same beam of light can be described alternatively by 
two vectors, p or K, the momentum or the wave-number, it must 
necessarily be assumed that 

p — const. K. (1) 

From the formulae akeady given (p = ^v/c, K = 1/A = v/c) it follows 
that the constant is h. 


We may go farther; in the principle of relativity the momentum p 
and the energy divided by c form together what is known as a 4- vector; 
thus (PxiPyfPzi ^1^) components of a vector in the space-time 
continuum. So are v/c), the four numbers specifying the 

state of a wave. If the same physical event is to have a particle and 
a wave description it follows that each component of the one vector 
must be proportional to the corresponding component of the other. 


Thus 


W — hvf p = ^K, 


where h is some constant. These are equations which we have already 
established on experimental and theoretical grounds, with h equal to 
Planck’s constant. 

This curious dual property of light must now be defined more clearly; 
if we wish to calculate the 'path of a light beam, through any series of 
BUts or gratings, we must use-the wave theory. The wave theory gives 



6 


THE SCHBtoINGER EQUATION 


§1 


US at all points of space a value for the electric and magnetic vectors 
E and H and predicts that the energy density there will be 


Sir 


( 2 ) 


But when we come to investigate how light reacts with matter we find 
that it usually gives up its energy in quanta hv, and always acts as 
though it were a stream of particles. Formula (2), in fact, gives only 
the energy density averaged over a long time in a steady beam. In 
other words, the average number of light quanta per unit volume in 
the beam is ^ E^-\-H^ 


hv Stt 

In general, whether we are dealing with a steady beam or not, we can 
say that the probability that a light quantum is at a given instant in 
the volume element dxdydz is 

1 E^+H^ 


hv Stt 


dxdydz. 


This dual nature of light was quite unexpected, and was forced on 
theoretical physics by overwhelming experimental evidence. It has 
never been ‘explained’. On the other hand, it has been found that this 
dual nature is not confined to light, but is common to electrons, atoms, 
and in fact all material particles. It has then to be accepted as a 
fundamental law of nature, on which any scheme of mechanics must 
be built up. 

Wave properties of rmterial particles 

The evidence that cathode-rays consist of particles, the electrons, 
need not be reviewed here. Our whole system^of thought about atoms 
is based on it, as is witnessed by the confidence with which such a state- 
ment is made as that the Uranium atom contains 92 extra-nuclear 
electrons. All the arguments of this book will be based on the assump- 
tion that the electron is a particle of mass m and charge e and that it can 
only be created and destroyed under rather special conditions peculiar 
to high-voltage laboratories, stellar interiors, and j8-active nuclei. 

The discovery that beams of electrons, and indeed beams of heavier 
particles, ions or molecules or neutrons, have wave properties was made 
between 1927 and 1928, by Davisson and Germerf and by G. P. Thom- 
spuj: for electrons and later by Stem and his collaborators§ for particles 

t C. Davisdon and L. H. Qermer, Phys. Rev. 30, 707 (1027). 

X G. P. Thomson, Proc. Roy. Soc. A, 117, 600 (1928). 

§ F. Knauer and 0. Stern, Zeits.f. Phyaiky 53, 786 (1020), 1. Estermann and 0. Stem , 
ibid. 61, 116 (1030). 
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SI 

of atomic mass. The wave effects — ^at any rate the most striking ones, 
were shown when electron beams were reflected or scattered by crystal- 
line substances, diffraction phenomena extremely similar to those with 
X-rays being obtained. These phenomena enabled the wave-length A 
of the waves to be determined in terms of the energy W or momentum 
of the particles; the latter can of course be determined for charged 
particles from the accelerating voltage or the deflexion in a magnetic 
field. The experiments showed that the relation was as follows: ' 

A = hjp. (3) 

This is often written x , , 

A = hfmv, 

but (5) is correct even for particles moving with relativistic velocities.! 
It will be noticed that (3) is the same relation as that which holds 
between the momentum of a light quantum and the wave-length of 
the corresponding light wave. 

The magnitude of the wave-length is of the order 10“® cm. or less for 
fast electrons, and thus of the order of magnitude of the diameter of 
any atom. 

In a famous and interesting paper published in 1924 before the 
experimental discovery of the dual nature of electrons, de BroglieJ 
argued that if such dual properties existed, (3) must be true. His 
argument was the one already put forward in § 1.1; if K, y are the wave 
number and frequency of the wave, p, W the momentum and energy 
of the particle, both form 4-vectors in the space-time continuum. 
Therefore, since all components of (K, v) must be known if (p, IT) is 
known and vice versa, we must have 

p = ^K, W = hv, 

where h is some constant. The facts already known about light quanta 
suggested that the constant would be Planck’s constant. 

We shall return in § 4 to a consideration of the equation W = hv. 
We see, then, that just as for light quanta, when we wish to calculate 
the path of a beam of electrons (at any rate through crystals), we have 
to treat the beam as a wave. The intensity of the wave will be pro- 
portional to the density of electrons in the beam; more exactly it gives 
the probability of finding an electron at some point. However, imlike 
the position with regard to light, where Maxwell’s theory of E and H 

t For the experimental evidence for this, cf. J. V. Hughes, Phil. Mag. 19 , 129 (1936) ; 
see also S. C. Curran, ibid. 24, 953 (1937). 

t L. de Broglie, Diaaertation (Masson, Paris, 1924) ; see also Phil. Mag. 47, 446 (1924) ; 
Ann. de PhyaiquCt 3, 22 (1925). 
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already existed, nothing was known about the electron waves. A theory 
had to be built up from the beginning to fit the facts. 

In the literature the wate amplitude is denoted by 0 and is called 
the wave function. As we have seen, the only way these waves can 
make their presence felt is by the relation between the wave intensity 
and the density of electrons. The simplest assumption that we can 
make is that the density of electrons is proportional to the wave 
intensity. Since we know nothing a priori about those waves, it is 
simplest to choose the units in which 0 is measured so that the density 
is eqml to the wave intensity. Thus 

= number of electrons per unit volume, 
or dxdydz = the probability that an electron is in the 

volume element dxdydz. 

The wave function i/f is a complex quantity, and may therefore be 
written- ^ =/+*?, (4) 

where / and g are real functions of position and time. The intensity 
of the wave, and hence the average number of particles per unit volume 
at any point, is taken to be equal to the square of the modulus of the 
wave function, which is written ^*0 or and defined by 
0 *^ = 1 ^ 1 ^ 

It is often a stumbling-block to the beginner in this subject that a 
physical quantity, the wave function, should be represented by a 
complex function. The reason is as follows. We know a priori nothing 
about the wave function, but we should expect, by analogy with the 
case of light waves, that the type of expression which in other wave 
systems represents the energy density would in this case give the particle 
density. But the energy density in any wave system is always given 
by the sum of the square of two independent functions whose magni- 
tudes define the state of the wave. For a light wave they are E and H 
and the energy density is For elastic waves they are 

the displacement and velocity of the medium. Thus for the waves 
associated with an electron or other material particles, it is reasonable 
to assume that the state of the wave is defined by two quantities / and g 
at any point, and it is convenient to combine them into a single func- 
tion 0 by means of equation (4). ^ 

It may be noted that Maxwell’s equations for the electromagnetic 
field in free space may be treated in the same way; the equations are 


-ccurl£=— , 


ccurl^ = 


dE 

dt* 
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and if tp is written for both equations become 

ccurl^ = i 

ct 

It will be convenient at this stage to make a further assumption 
about the form of a plane wave. A plane wave travelling, say, along 
the a^-axis has, for any type of wave motion, the form 

A sin 2TT(Kx—vt-\-€)y 

where K is the wave number and v the frequency. In a plane polarized 
light wave E and H are in phase; thus the energy density is proportional 

and fluctuates with time at any point. There is no reason to think that 
anj^ such fluctuation occurs in the wave associated with an electron; 
it would in fact be difficult to understand what physical significance 
could be ascribed to a rapid fluctuation of the probability that an 
electron would bo found at a certain point. It is therefore reasonable 
to suppose that / and g are 90° out of phase, so that 

/ = A cos 27r(Kx—vt)y 
g = ABm27T(Kx—vt)y 
and P+g^ - A2, 

which keeps a steady value independent of time. Making use of the 
complex function ^ we see that a plane wave is represented by 

We shall represent a wave going in the other direction by 

ifj = 

In the remainder of this book we shall follow the accepted convention 
and use always the complex wave function and shall not refer again 
to the real and imaginary parts, / and g. 

1.3. The connexion with Newtonian mechanics 

We have seen that when we wish to calculate the path of a beam 
of electrons or of any other material particles, we must postulate the 
existence of a train of waves ijt of wave-length hlp\ the intensity 
gives us the average number of electrons per unit volume in any part 
of the wave. The necessity for this procedure is shown by the behaviour 
► of electrons in passing through crystals, where scattering centres (the 
atoms) have diameters comparable with the wave-length. In the 
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' absence of crystal diffraction, however, Newton’s Laws of Motion have 
been entirely successful in accounting for the behaviour of beams of 
charged particles, in particular their deflexion by electric and magnetic 
fields. It would be very unsatisfactory if we had to assume that 
Newton’s laws were valid in the latter case and wave mechanics in the 
former. Actually, wave mechanics is perfectly general and includes 
Newtonian mechanics as a special case. In 
this section we shall show, for the case of 
deflexion of a beam of electrons by an electric 
field, that wave mechanics gives the same 
result as classical Newtonian mechanics so 
long as the field remains sensibly constant 
over the whole width of the beam. The case 
of magnetic fields is treated in § 8. 

The electron beam is supposed to enter the 
field at 0 in Fig. 2, where the kinetic energy 
is W. The field is supposed to be conserva- 
tive, so that at any point of space a potential 
energy function V{x,y,z} can be defined, 
equal to the work done to take the electron 
from 0 to the point (x, y, z) considered. The kinetic energy there will be 

W—V(x,y,z), 

From the wave point of view the wave-length of the de Broghe waves 
must be hip. Since at any point (Xyy,z) 

p^ = 2m(W-V)y 

it follows that the wave-length A at that point is given by 

hlyl{2m(W-^V)}. (5) 

The wave-length therefore varies continuously from point to point, 
and therefore the problem of determining the path of the beam is that 
of determining the path in a medium of continuously varying refractive 
index. This can be done quite generally, for any type of wave motion, 
as follows; 

We require at each point along the path of the beam the curvature 
of the beam due to the refraction. In Fig. 2 let AB, A'B' be lines 
drawn along the intersection of two successive wave fronts with the 
plane of the paper. They will meet at P, the centre of curvature of the 
beam. 
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If A is the wave-length at any point, say at BB\ then the wave 
length A A' on the other side of the beam may be written 




where t is the width AB oi the beam and djdn denotes differentiation 
along BA, Also denoting PB by i2, PA by R+t, we have from the 
similar triangles PBB\ PA A' 



or 


R"^ \ dn 


( 6 ) 


This formula gives the radius of curvature R of the beam in terms of 
th(* rate of change of A, or in other words, the rate of change of the 
refractive index. 

In our case, using formula (5) for the wave-length, (6) gives 


1 1 dV 

i?~2(lf-F) dn 


(7) 


But this is exactly the formula that we should obtain using Newtonian 
mechanics. This may be seen as follows: according to Newtonian 
mechanics, the curvature of the orbit of the particles in the beam can 
be obtained by equating the centrifugal force mv^lR to the force normal 
to the trajectory exerted by the field on the particles. This is equal to 
dFjdn. We have therefore 

mv^ ^ dV 
R dn ' 


But since mv^ is equal to 2(W—V), this is identical with (7). 

So long, therefore, as the field remains sensibly constant across the 
width of the beam, a calculation of the path according to the principles 
of wave mechanics will give the same result as a calculation according 
to Newtonian mechanics. 

A beam must necessarily have a width large compared with the 
wave-length. It is not difficult to see, therefore, that the condition for 
. the applicabihty of Newtonian mechanics is that the kinetic energy of 
the particles should change by only a small proportion in a distance 
comparable with a wave-length. If this condition is not fulfilled diffrac- 
• tion effects characteristic of wave motion will occur. These are discussed 
in the next sections. 
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2. The wave equation of Schrhdinger 

In this section we shall derive Schr5dinger*s equation for the wave 
function ^ associated with a particle or beam of particles, all with the 
same energy, moving in an electrostatic dr other field of force. The 
total energy of each particle will be denoted by W, and its potential 
energy in the field of force by F(aj, y, z), as in the last section. Therefore 
we have 

kinetic energy of each particle = IF— z). 

Now a beam of waves moving in either direction along the 2 -axis 
has the form ^ ^ 

and both expressions for 0 satisfy the equation 

= ( 8 ) 


Conversely, equation (8) means that ^ has the form of a wave of wave- 
number K moving in one direction or other along the 2 -axis, or else 
that ip is given by the superposition of two such waves 

It will be remembered (cf. p. 9) that the time factor is always of the 
form More generally, a wave of wave-number K moving in the 

direction given by the direction cosines (I, m, n) will have the form 

^|f — j^Q2niK(lx-imy+nz)-Z‘nivt 


Remembering that = 1, it will easily be verified that 

(9) satisfies the equation 

VV+47r2XV = 0, (10) 

where denotes the operator 


V2 = 


^ ^ ^ 

dx^ dy^ dz^' 


Conversely, the most general solution of (10) can be formed by the 
superposition of any number of waves of the type (9), with the same 
wave-length but different directions and amplitudes. Equation (10), 
therefore, when applied to a wave function, means simply that the 
wave-length is l/K, 

Now we know from direct experiment that the relation between the 
kinetic energy If— F of a particle and the wave-number K is 

= 2m(W-V)lh\ 


( 11 ) 
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If we substitute for K in equation (10) we obtain the well-known 
equation of SchrOdingerf 

= ( 12 ) 

If V is constant, or varying very slowly from place to place, this 
expresses nothing more than what we already know about the wave 
function. It is, however, a new assumption that ^ will satisfy this 
equation even when significant changes in V occur in distances com- 
parable with A, as, for example, within the atom. This assumption will 
be made; it is justified first because of its simplicity, but ultimately 
because it gives results in agreement with experiment. 

In what follows we shall write this equation in the form 


3. Motion of electrons in one dimension 

In this and in the next section we shall discuss some simple problems 
which illustrate the conditions under which Newtonian and wave 
mechanics give different results, and what the nature of the difference is. 

In particular, our analysis shows that when particles move in fields 
such that W—V changes appreciably in a distance comparable with 
a wave-length, the results differ from classical mechanics. To show 
this we consider the extreme case of a discontinuous ‘potential jump’, 
where the kinetic energy of the particles changes discontinuously. 
This may be thought of as the limiting form of a very strong field 
acting over a very short distance. We shall show that if a beam of 
particles is incident on a potential jump, some are reflected and some 
transmitted. According to Newtonian mechanics they would be all 
transmitted or all reflected, according as to whether the height of the 
jump Vq was greater or less than the kinetic energy W, 


3.1. The effect of potential jumps on the motion of an electron 
We consider first the effect of a potential jump on the motion of 
an electron, i.e. we consider the solutions of the one-dimensional 
Sohr5dinger equation when the potential function is given by 



{x < 0), 
(x > 0). 


t E. Schrodinger, Ann. der Phyeik, 79, 361 (1926). See also E. Schrodinger, Collected 
Papers on Wave Mechanics (London, 1028). 
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There are then two cases to consider according as TJ ^ TT, where W is 
the energy of. the electron. 

Case (i). < W, The differential equations governing the motion 

of the particle are then 

= 0 (X< 0 ). 

S+W = o (*>-0), 

where ^ = ^(W-V,). 


The wave function to the left of the potential step may be taken as 

ifs = (e^OLX_^jl^-iocx^^-27nvt (jj. 0), 

consisting of an incident wave of unit amplitude and a reflected 

wave of unknown amplitude A, When x > 0 we have a solution 

representing a wave transmitted through the potential jump. 

The fact that there is a finite discontinuity in the potential function 
V occurring in the Schrbdinger equation means that ip" suffers a finite 
discontinuity at a; = 0. The increase in ip' in crossing the origin is then 
€ 

lim f ip" dx = ]iv(X€{p''(—€)-\-p'^(€)} = 0, 

€-K) J 


so that pf p' are continuous at a: = 0. Thus there are two equations 

1 +^ = B, 

= pB 

* for the determination of the constants A and B. It follows immediately 

2a 


that 


A 


B== 




Now the velocity of the particles in the incident and the reflected 
beam is p(ir-ro) U' aK 




m 


BO that in an incident beam of unit amplitude the number of electrons 
crossing unit area in unit time is ah/m — Ni, The number of particles 
in the reflected beam is 
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where is the velocity of the transmitted beam. Similarly 

the number of electrons in the transmitted beam is 

m m\fi+(x) 

From these expressions it follows immediately that 
Ni = NM 

so that, as . we would expect, the total number of particles is constant.. 

This analysis shows the fundamental difference between Newtonian 
and wave mechanics. According to Newtonian mechanics, it would of 
course follow that = 1. 

Case (ii). }{> W. In this case the equation for the wave-function 


0 in the region a; > 0 is 




dx^ 




whfre _ 2m{y^—W)\ for the transmitted wave we then take the 
exponentially decreasing solution 

0 = (13) 

The boundary conditions at a; = 0 give 

l-(-A == B, ia(l—A) — —yB. 

Solving for A we find A = - — 

so that AA* = 1, 

showing that the amplitude of the reflected wave is equal to the 
amplitude of the incident wave. Thus all the electrons are reflected. 
If the theory had not predicted this it would have needed revision. 


3.2. The tunnel effect 

We have seen in the last section that the wave function of a beam 
of electrons reflected by a potential jump does not vanish at the point 
a; = 0 where, according to the Newtonian interpretation, the electrons 
should be reflected. From this point onwards, according to the formula 
(13), the wave function decays rapidly and is proportional to e~y®. , 
We shall now consider the incidence of a beam of electrons upon a 
‘potential barrier’ bounded by sharp potential jumps. We shall take 
for the potential energy of the particles 

{0 (a:<0), 

Vq (0<a;<a); 

0 {x> a), ^ 


V(x)^ 
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and shall suppose that the kinetic energy If is less than PJ. The 
Schrodin^er equation then assumes the form 

= 0 (« < 0, a; > a), (0 < a? < 

where ocfi = V(2mTf), yh = V{2m(l^— Pf)}. Then, just as before, the 
solution of the wave equation for a; < 0 will be taken to consist of an 
incident and reflected wave 


and the solution for a; > a, a transmitted wave 


ijf — J^^ioLX-^Trivi 

Within the potential barrier, for 0 < a: < a, the general solution is 

The boundary conditions are as before that ijs and tp' shall be con- 
tinuous; these conditions at a; = 0 give 

1+^ = O-bA ici(l-A) = y(C-D), 


and at a: — a 


(7ey«+De-y« = 


y{CeY^—De-y^) = 


Eliminating Ay Cy D from these equations we obtain 
5 = 2 

i 

If we define as transmission coefficienty or transparency, T, the ratio 
of the number of electrons crossing unit area per unit time in the 
incident and transmitted beams, we have 


2 cosh ya~{-i (- — - j sinh ya \ . 


\“ y/ 


T = 55* — 4^1 4cosh2)/a + |-— -j sinh^yaj. 


If ya is large both coshya and sinhyu are approximately equal to |ey“ 
and the transparency of the potential barrier becomes 


In most cases the amplitude of the wave which ‘penetrates’ the 
‘potential barrier’ between a; = 0 and x'= a will be vei^ small. Never- 
theless these arguments show that, according to wave mechanics, if a 
beam of electrons is incident on a potential barrier of the type described, 
a small proportion of the total current will penetrate through it. Thk 
prediction of wave mechanics has been termed the ‘tunnel effect*. 
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The tunnel effect has several important applications. It applies of 
course to protons and to heavier particles as well as to electrons, but 
for heavy particles the proportion that will go through potential barriers 
of atomic dimensions is negligible and the tunnel effect is unimportant. 
For protons and a-particles it is, however, of great importance in the 
theory of the nucleus and of radioactivity.f Electrons will in practice 
pass freely through potential barriers a few electron volts high and of 



Fig. 3 a. The rectangular barrier. 



thickness up to about ten atomic diameters. This is the reason why 
an electric current will flow between two metals in contact, in spite 
of the layers of oxide or grease with which a metal surface is normally 
contaminated, and the reason why electrons can pass between metallic 
electrodes and ions in solution. 

The wave equation for a triangular barrier can also be solved 

f G. Gamow, Structure of Atomic NucUi^ chap, v (Oxford, 1937). 

O 


6072 
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exactly. Consider a triangular barrier defined by the potential function 
(Fig. 36) 

'0 (x<-VJk) 

Vo+kx {-VJk<x<0) 

Vf^—kx (0<x< VJk) 

.0 (x>Volk). 

The altitude of the triangle is thus and the slope of the sides ±ii:. 
The Schrodinger equation then becomes 


= 0 

A\ 

(14a) 

ift" — (P^^2KX)ilt = 0 

1 

/A 

/A 

(146) 

= 0 

/A 

/A 

(14c) 


where a® = jS* = 2ni(V^—W)lh\ and k = mklh^. 

If we substitute f = (2<c)*(a:+)3®/2K) 
in equation (146) then it becomes 

the solution of which may be writtenf 

0 = CiAi(f)+C2Bi(f), 

where Ai(f), Bi(|) are the Airy integral and its companion function 


CD 

Ai(?) = i J cos(f«+ J 2 ») dz, 

0 

00 

Bi(a = i J {c^-»**+sin(2f+i2*)}<fo. 


Similarly the substitution 


ij = (2(c)»J-x+|-J 


t H. Jefireys and B. S. Jeffreys, Methods of Mathematical Physics^ 447 (Cambridge, 
1946). 
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reduces the equation (14 c) to the same form. We may then write for 
the wave function 

ij, = I* < — 

0=CAi(^)+DBi(f) 

EAHr,)+FBi{r,) 

If we apply the conditions that t/t and are continuous at a; = ±Po/^ 
and at a; = 0, we obtain six equations for the six constants A, By Cy 
Dy E, F. Solving these equations we obtain an expression for B in 
terms of the Airy integrals and companion functions with arguments 
(P^I2k)* and — (a2/2/c)i. If k is small in comparison with a® and j8® we 
can replace the Airy integrals by their asymptotic expansionsf 

Ai(j!) ~ ^2-lexp(— 12 »), Bi(2) ~ i2+*exp(§2*) 


to obtain finally T = BB* == exp(— 4j8®/3/c) 

for the transparency of the triangular barrier. If we introduce the 
width I of the barrier at the height of penetration, so that 

I „Fo-W 
2Fo/^ V ’ 

wehave ; = 

k K 

and the formula for the transparency becomes 

T = exp(--P). (15) 


3.3. The WKB method 
A solution of the Schrodinger equation 

appropriate to the case where W—F is everywhere positive and where 
W—V changes by a small amount in a distance comparable with the 
jvave-length ^{2w(lf— F)}*"* can be obtained using a formula due 


f H. and B. S. Jeffreys, loo. cit., p. 476. 
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originally to Jeffreys.'l' The Sohrodinger equation may be written in 

f+/(»¥ = 0. 

with f(x) = 2m(W—V)lh^ a slowly varying function of x. If we write 
the solution of this equation in the form 

tfs = expia(aj), 

X 

then in the case where f{x) is a constant a(x) is simply fh or J f*dx, 

0 

If/ is not a constant, then a satisfies the differential equation 

i0L''-0L'^+f= 0 . 

If we assume that a is also a slowly varying function of x so that 

(16) 


then we have 


X 

0^ = i J f^dx. 


For a second approximation we substitute this solution in the term ioc 
in the differential equation for a and obtain 


Extracting the square root and using the fact that/' is small m com- 
parison with / we obtain the equation 

a' = 

which may be integrated to give 

X 

a = ± J/*<^!r+4ilog/^ 

Xa 

giving ^ = (7/“lexp|±i J f^dc^t (17) 

where (7 is a constant. 

This solution is of course valid only in those regions for which the 
condition (16) is satisfied. Using the first approximation for a we see 
that this condition is equivalent to ^ 



\dx 




<1. 


t H. Jeffreys, Proc. Lond, Math, Soc. (2), 23, 428^36 (1924); Phil. Mag. 33, 461-6 
(1942). 
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According to equation (17J the total number N of electrons crossing 
unit area in unit time is 

N = = CG*f-^{2m(W-V)Y = (7(7*, 

which is a constant, as it ought to be. 

In the preceding analysis it was assumed that the function f{x) was 
positive in the range considered. If f(x) < 0 we find that the solution 
can be written in the form 

iji = (-/)-»exp(± J (-/)mA 
' a-0 ' 

Jeffreys also gives formulae for connecting up solutions at a zero of 
the function f{x)y which we may take to be at x = Xq, Suppose for 
definiteness that 

fix) <0, x> x^; fix) >0, x< Xo, 
then the connecting formulae can be written as 

(/)-»exp[±i(L+j7r)]-s-» (-/)-i[expJf±^jexp(-Jlf)], (18) 

X Xq 

where iff = J (—f}^dx, L = ^pdx, 

Xt X 

This method of approximating to the solutions of a second-order 
linear equation was first applied to Schrodinger’s equation by Wentzel,t 
Kramers, J and Brillouin§ and is known usually as the WKB method. 
Higher approximations may be obtained by writing the solution of the 
wave equation in the form 

'/' = exp(^ 

where x is a power series in (fiji). 



with coefficients Xm which are functions of x. By substituting in the 
differential equation and equating powers of h it is found that 


t G. Wentzel, ZeitaJ. Phydk, 38, 618-^ (1926). 
t H. A. Kramers, ibid. 39, 828-40 (1926). 

§ Brillouin, Comptes remits, 183, 24-6 (1926). 
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in agreement with the formula (17), and that 

x!+xi _ 61 ^'* y 

2x„ ~ 32(2m)‘(W-F)* 8(2m)‘(F-7)»’ 

and similarly for xsj ^ desired degree of accuracy. 

A rigorous treatment of this method of approximation has been 
given recently by Kemblef and Titchmarsh.J 
The condition (16) for the validity of this method is that 

1. (19) 

A further condition is that in the neighbourhood of a zero x — x^oi 
V—W, the function V\x){x—x^ shall be a good approximation to 
V-W, 

3.4. The transparency of potential harriers by W KB method 
In the theory of a-decay it is of interest to study the motion of a 
particle in a field whose potential is of the form shown in Fig. 4.§ 



From the formula (17) the solution representing a wave going from left 
to right in the region x>Xiis 


h = {5(1F-F)j'iexp i J j^(lF-F)jid* 


From the connexion formula (18) it follows that the wave function 
for the region x^^<x <x^ which connects up with at a; = x^ is 




where jfj = J (7- )7)ji dx. 

X 

t Kemble, Phys. Rev. 48, 649 (1935). 

t E. G. Titchxnarsh, Eigenfunction Expansions associated with Second-order Differential 
Equations, chap, viii (Oxford, 1946). 

§ O. Gamow, ZeUs. /. Physik, 51, 204 (1928); E.-Gondon and R. W. Gurney, Phys* 
Rev. 33, 127 (1929). 
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Writing if, = I J _ f j j^(F- W)j* dx = A-M^ 

XX Xx ^ 

we may put the expression for in the form 

The solution joining this at a; = % is 

Xt 

where 1 ^= j j^(W-F)j^ dx. 

X 

The wave function represents an incident wave 

and a reflected wave ijj/, the wave function ^3 represents the wave 
transmitted through the potential barrier. The transparency of the 
barrier measured as the ratio of the current density on either side is 

T = vMtKMt = 

If e-^ ^ e-^, as in the case of a high wide barrier, then 

[ Xi 

-2jj~(F-iF)j^d* . (20) 

Xx 

This differs slightly from a formula given by Gamow,t who calculates 
not the ratio of current densities but that of probability densities. 
The derivation given here is due to B. Jeffreys. J 
For a triangular barrier of the type considered in § 3. 2 we have from (20) 

T = exp [-2 J j“(i(+*)ji _2 J J, 

*■ -li 0 •* 

where I ^ 2 %-W)lk = ^Ik. 

Thus T = exp| = exp(-P), 

in agreement with equation ( 16 ). 

A comprehensive account of the determination of the transparency 
of a potential barrier is given in Kemble’s book.§ The application of 

, t G. Gamow, Structure of Atomic Nuclei, p. 94, equation (26) (Oxford, 1937). 
j B. Jeffreys, Proc, Camb. Phil. 80c. 38, 401 (1942). 

S E, C. Kemble, Fundamental Principles of QuarUum Mechanics (New York, 1937), 
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the results of calculations of this type to the theory of a-decay is given 
in the monograph by Gamow, but the treatment of barrier problems 
given there is unsatisfactory since the asymptotic expansions are em- 
ployed at points where they become infinite. 

The penetration of partleles into potential barriers in several dimen- 
sions has been considered by Kapur and Peierls.f 


4. Frequency and group velocity of the waves associated with 
material particles 

For any kind of wave motion there exists a definite relation between 
the frequency v and the wave-length A. For light waves in a vacuum, 
for instance, this relation is 

V — cjX, 


In general we may write v = vjX, 


where v is the ‘wave velocity’, which may be a constant, or may be 
itself a function of the frequency, as is the case for light waves in a 
dispersive medium. 

The wave velocity is the velocity with which the crest of each wave 
travels. The group velocity is the velocity of a wave pulse or of a 
wave front. For instance, if a stone is dropped into water, the front 
of the disturbance travels out with the group velocity. The time taken 
for a sound to reach our ears is equal to the distance of the source 
divided by the group velocity. All direct ways of measuring the velocity 
of light give this quantity, and it is also the speed with which radar 
pulses are propagated. 

The group and wave velocities are equal only if the wave velocity 
is independent of frequency (as for light in a vacuum). In general the 
formula for the group velocity Vq isj 


Va: 


dv ^ 2 ^^ 

5(1^)“ ~ 5a’ 


In terms of our usual notation, K = 1/A 


Since v = vK this gives 


y _ 


( 21 ) 


Formula (21) is most easily proved by considering the velocity of 


f P. L. Kapur and R. Peierls, Proc. Roy. "Soc. A, 163, 606 (1937). 
X C. A. Coulson, WaveSf p. 132 (Edinburgh, 1941). 
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the wave groups formed when two wave trains of nearly equal frequency 
are superimposed. For let these two wave trains have the ampUtudes 

A sin 2TT{Kx—vt) 
and A sin 2TT(K'x—vt). 

On adding these we obtain 


2 A sin 277 ^ ^ cos 27 t 


K-K' 


-X — 



If if, K' are nearly equal this represents a train of wave groups; each 
wave moves with the wave velocity but each group 

moves with the velocity 


K-K'* 


which, if V and v are nearly equal, tends to dvjdK. 

The same formula can be shown to give the velocity of a single wave 
pulse. 

The apphcation of the concept of group velocity to wave mechanics 
is as follows: suppose that a beam of 
particles is incident with velocity F on 
a screen, in which there is a hole which 
can be closed by a shutter. The shutter 
is closed to begin with, then opened 
for a time ^o> then closed again. 

Obviously a beam of length will 
pass through the hole and travel as a 
whole with velocity F. According to 
the concepts of wave mechanics, how- 
ever, we must describe the whole phenomenon in terms of the wave 
function 0. A continuous train of waves falls on the screen; when the 
shutter is opened a train of waves of limited length, that is to say a 
wave group or wave packet passes through, and ets usual the intensity 
of the wave is equal to the density of electrons in the beam. This wave 
group must travel with the group velocity of the waves; thus if wave 
mechanics is to give a correct description of observed phenomena, the 
group velocity of the waves must be equal to the actual velocity of 
the particles that they represent. 

Up to the present in this book, in considering steady beams of 
• particles, we have not had occasion to define the frequency of the 
waves associated with them. Now, however, we must do so. The 



Fig. 6. 
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group velocity, by equation (21), is equal to dvjdK. The velocity V of 
the particles is equal to AJP/wi. We have therefore 

^==V = hKlm. (22) 


We can integrate this equation and obtain 

V — ^hK^lm+con&t. 

If we substitute for K in terms of F, we find 

V — JwF^/A+ const. 


We thus see that, apart from an unknown constant independent 
of F, hv is equal to the kinetic energy of the particles with which the 
waves are associated. 

We have next to consider the value of this constant. It will be 
realized that a steady beam of particles moving for an indefinite time 
through a field of force must be represented by a wave of constant 
frequency. It is natural to take a point where the potential energy 
is zero and to define hv as the kinetic energy there. We see therefore 


that 


hv= W, 


(23) 


where W is the total energy of each particle. This is the formula which 
we shall use for the frequency. It must be realized, however, that the 
point where the potential energy vanishes is in fact arbitrary; for 
instance that of a particle acted on by gravity may be considered zero 
on the earth’s surface, or at an infinite distance. The total energy of 
a particle, therefore, does contain an arbitrary constant; all that has 
physical meaning is the change in the energy. 

It is rather disconcerting to find that the frequency of these waves 
also contains an arbitrary constant; it suggests that, though the equa- 
tions of wave mechanics are correct in their description of how matter 
actually behaves, these waves have not the same kind of physical 
reality as sound or electromagnetic waves. This view will be confirmed 
by the considerations of the next section. 

The derivation of equation (23) is non-relativistic. A relativistic 
derivation can be given. We know that K = pjh, where p is the 
momentum of a particle; and the relation between velocity F and 
momentum pis ^ ^ ,„FMl-FVc*). 

We therefore have 


= F = = chKj^(mh^-\-h*K^). 
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On integrating \ye obtain 

V = const. 

= c^(mV-\-p^)lh+Qomt. 

= Tf/A+ const. 

We see that as before hv is equal to the total energy W of the particle, 
the kinetic energy together with energy of the rest mass, and also an 
arbitrary constant. 

The derivation given here of the formula (23) for the frequency is 
based on experiment, or rather it shows that if the equation were not 
satisfied wave mechanics would give an incorrect description of the com- 
monest facts. It may be compared with the derivation from the principle 
of relativity given in §1,2. This derivation shows that the arbitrary 
constant should be zero. 

5. The wave equation for non-stationary phenomena 

The wave equation which we have considered already, namely 

VS‘+~(W-V)>j, = 0, (24) 

is applicable to steady beams of particles each particle having energy W ; 
the particle density does not then vary with time. In the pheno- 
mena considered in the last section does vary with the time; a more 
general wave equation is therefore required. 

This wave function must be of the first order in the time, and thus 
linear in the operator djdt. The reason is that in any form of wave 
motion, if the displacement and velocity of the medium are known, 
the further motion is determinate. In electromagnetic waves the same 
is true of E and E, We have seen that the real and imaginary parts 
of ifs are analogous to E and H, Thus if iff is given at any moment its 
future behaviour should be calculable from the wave equation. It 
follows that this equation must be linear in djdt. 

The most general form of the wave equation for a particle moving 
in a field of potential energy V will be made up by superimposing wave 
functions of the type 

<li = (26) 

where satisfies equation (24). This follows from the assumption 
•made on p. 12 about the form of the time factor, and from equation 
(23) for the frequency. The functions which are to be superimposed 
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will have different amplitudes and values of W, It will easily be seen 
that ^ , 


27riW 


8t 


0 . 


The equation linear in d/dt, from which W has been eliminated and 
which (25) satisfies, is therefore 


h difj 
277% dt 


Sirhn 




(26) 


This then is the general equationf satisfied by the wave function if/. 

We must now use this equation to show that wave mechanics will 
give the same results as classical Newtonian mechanics when the field 
of force in which the electrons move does not vary much in a distance 
comparable with the wave-length. This is almost implicit in what 
has gone before; we have shown in §1.3 that a continuous beam of 
waves under these conditions follows the same path as a beam of 
particles according to classical mechanics, and also that the velocity 
of a wave packet is the classical velocity of the particles. The following 
more formal proof may, however, be of interest. 

If if/ represents a wave group or wave packet, the quantity 

j xif/if/'^dxdydzl j if/ifj* dxdydz 

will represent the average ic-coordinate of the electrons, information 
about whose position is given by the wave function ifi. It will be 
sufficient if we show that x, y, z satisfy the classical equations of motion. 
We shall in fact prove that 

which shows that the wave packet follows the Newtonian path as 
closely as its finite size will allow. 

The proof makes use of Green’s theorem in the form 

j (AV’‘B-B^»A)dx = 0 

ijPthe integral is over all space and A and B tend to zero outside a finite 
region. 

We first multiply equation (26) by wif/* and integrate over all space; 

t For a discussion of the relation of this equation to other wave equations occurring 
in physics and of the analogy between wf ve mechanics and physical optics see Condon 
and Morse, Quantum Mechanics, p. 10 (New York, 1929); K. K. Darrow, Rev. Modt 
Phys. 6, 23 (1934). 
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and then multiply the complex conjugate equation by xifs and integrate 
over all space; we then subtract the two equations and obtain 

The integral on the right can be replaced by 

We thus obtain, since the first term vanishes by Green’s theorem, 


dx __ hi 

dt 27 t . 


^ 8x 


If we differentiate further with respect to the time we obtain 
2 wJ U< Sx^^dxdt 


27 t J \dt 8x ^ dxdt j 

hi C I ^0* ^0\ 7 


by partial integration of the second term on the right-hand side. If we 
now substitute for difjjdt and dijs^jdt from (26), we obtain for the right- 
hand side 

-is/ hf +^'^•2)*+ J 

The first term vanishes by Green’s theorem and the second gives 


We thus obtain the classical equation of motion in the required form. 

We shall also verify that the equation (26) predicts the conservation 
of matter; since is equal to the probability that the particle is 

in the volume element dr, it follows that 


Ji-ant , 

integrated over all space must be equal to the number of particles 
described by the wave function; it will be equal to unity if the waye 
function is used to describe a single particle. It should therefore be 
possible to deduce from the wave equation that 


^J|^l®dT=0. (28) 

if this were not so, the prediction of the equation would not accord 
with common experience, and it would have to be modified. 
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The proof Is as follows; the left-hand side of (28) may be written 

If we substitute for and from equation (26), we obtain 

(29) 

Integrated over all space this gives zero by Green’s theorem, which is 
what we set out to prove. 

If we integrate over a finite volume (29) is equal to 

— f (0*grad^-^grad^*)„dyS, 

477171 J 

where dS is an element of the surface bounding the volume, the inte- 
gration is over the whole surface and the suffix n denotes that the 
component of the vector within the brackets normal to the surface 
should be taken. This suggests that the vector 

-^ (^* grad ^ grad ^*) (30) 

47JW 

should denote the number of particles crossing unit area per unit time. 
Multiplied by the charge on each particle it would give the current. 

It will easily be verified that if i/t has the form of a plane wave 
.4exp(iKr), (30) is equal to .4^4*?;, where v(~ hKIm) is equal to the 
velocity of the particles. 


hi 

477171 


6. The uncertainty principle 

In 1 4 it was shown that if two plane waves of wave-number K, K' 
arefeuperimposed, a series of wave groups is formed, and that the length 
of eaph, group, from one minimum to the next, is equal to the distance 
between adjacent zeros of 


cos 277 


IK-K' v-v'\ 


This distance is equal to ll{K—K^), 


If we analyse a single wave group, the sam6 relation is valid approxi- 
mately. If a wave group of length Ax and apparent wave-number Kq is 
analysed into its Fourier components, that is to say into a series of 
monochromatic waves, the wave numbers of these waves will be found 
clustered about the mean value Zq in a range of order AK given by^ 


the relation 


AxAK 1. 


(31) 
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This can be shown most simply as follows: we may represfflit the Y^ve 
group at any instant of time by 

ijj = ( 32 ) 

the exponential function exp(— is chosen as a simple and con- 
venient function which tends to zero rapidly as x increases to values 
greater than a. Thus Aa; 2a. If we analyse this function into plane 
waves it Will take the form 

00 

^ = J A(K)e^^dK. (33) 


It may easily be verified by direct integration that if we take 
A(K) = 7r*(7exp[-(7ir-Zo)V(72], 

(32) and (33) are identical. A(K) is thus very small except in a range 


Ail given by 


AJT = 2l7ra. 


Thus AxAK = 4/7r, which is of order unity. Equation (31) is therefore 
verified. 

It will be seen, therefore, that the shorter the length of a wave group 
the greater is the spread in the values of the wave-number, and thus 
in wave-length or frequency. This is of course a well-known and 
important principle in radio reception where too selective a receiver 
distorts short pulses or musical notes of high pitch, and also in optics 
where the breadth of a spectral line, and thus the spread in frequency, 
depends on the time during which the atom emits the wave and thus 
on the length of the wave train., 

The application of these principles to quantum mechanics is as 
follows: we return to the idealized experiment shown in Fig. 6 in which 
a beanj of particles moving with velocity F falls on a screen with a\slit 
in it, which can be opened or closed by a shutter; the shutter is opened 
or closed for a short time and a cloud of particles of length Ax equal 
to VtQ passes through. The density of particles in this group is 
determined by the wave function Following the principles described 
in the last paragraph, this wave function can be analysed into plane 
waves of wave-number lying in a range AK about that of the original 
wave incident on the screen, where AK IjAx. Since a wave of wave- 
number K describes a beam of particles of velocity hKjm, it follows 
that, whereas the beam of particles falling on the screen had a well- 
defined velocity F, the velocities of those that pass through the screen 
have a spread of velocities AF defined by 


AF Aa; him; 
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or, In terms of the momentum py 

A^) Aa; h. (34) 

This relation is known as the Unoertainty Principle, and was first 
formulated by Heisenberg, f 

The principle shows that, if wave mechanics is valid, and if it is 
always possible to describe a beam of electrons by a wave function, 
then any attempt to form a beam of electrons of finite and known 
length Aa;, as for example with slits and shutters, will necessarily disturb 
the momentum p of each particle by an unknown amount, the spread 
Ap being given by (34). 

The principle, however, goes further, and leads to a formulation of 
wave mechanics more general than that considered up to this point. 
Let us suppose that at a given instant of time measurements are made 
in any way of the position and of the momentum of a particle, and that 
the accuracy of the measurements are such that its position lies between 
ic— JAx and ic+JAa;, and the momentum between p—JAp andp+JAp. 
Then the equations of either Newtonian or of wave mechanics will 
enable the future position of the particle to be predicted, with certain 
uncertainties arising from the inaccuracy of the original measurement. 
The procedure according to wave mechanics will be as follows: we must 
write down a wave-function ^ which embodies the knowledge which, 
as a result of measurements, we have obtained about the position and 
momentum of the particle. The wave function will have the form of 
a wave group of length Arc and will be made up of waves with wave 
numbers in the range Ap/^. The wave equation (26), being linear in 
the time, will enable the form of ^ at any future time to be calculated. 
\^\^dx will then give the probability that the position of the particle 
at that time lies between x and x+dx. 

If the accuracy of measurement is such that Ax ^.pr^h the wave 
function will have the form of the wave group written down in 
equation (33). If AxAp > h (as is certain to be the case in practice) 
it is possible to imagine a number of these functions superimposed, 
with, random phases and values of Kq distributed over the range AZ; 
the form of the wave function will be analogous to that of a wave pulse 
of white light. But if Ax Ap is much less than hy it is impossible to write 
down a wave function at all which represents the results of the measure- 
ment. We must conclude, assuming that wave mechanics will always 
describe correctly the motion of a particle, that it is impossible to 

t W. Heisenberg, Zeita,f. Physiky 43, 172 (1927). 
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determine simultaneously the position and the momentum of a particle 
with errors less than those given by the uncertainty relation (34). Kds 
limitation, moreover, must not be one which |s temporarily imposed 
by the clumsiness of the measuring instruments now available, but 
must lie deep in the nature of things; for were it possible to overcome 
it, the whole edifice of wave mechanics would fall to the ground, and 
a direct contradiction would arise between the deductions made from 
experiments on electron diffraction and the result of our hjrpothetical 
experiment. 

It is therefore of great interest to examine the hypothetical experi- 
ments by which we could determine position and momentum simul- 
taneously, and to show that they do in fact yield an uncertainty of 
the predicted amount. The most famous of these demonstrations is 
the ‘Gamma ray microscope’ first discussed by Heisenberg. The argu- 
ment put forward is as follows: A beam of electrons is supposed to be 
travelling along the ai-axis with known velocity V and momentum p. 
It is desired to observe an electron and to measure its position; for this 
purpose it is imagined that a microscope will be used, and since the 
utmost accuracy is required a short wave-length should be chosen. 
The position can then be determined to an accuracy given by 

^x = A//a, (36) 


where a is the aperture, A the wave-length, and / the distance from 
the electron to the lens. 

Radiation cannot be scattered by an electron without disturbing the 
electron; radiation is scattered by free 
electrons according to the rules of the 
Compton effect, according to which the 
momentum lost by the light quantum 
when scattered is transferred to the 
electron. Thus if a quantum of frequency 
V, and hence momentum hvjc is scattered 
through an angle 0, momentum 

^j;(l— cos 6)lc 

is transferred to the electron. Thus we , 
cannot observe the electron without disturbing it. Moreover, we disturb 
it by an unknown amount, since owing to the finite aperture of the 
lens, 6 is not known exactly. In Fig. 6,.^ may lie between ABC and 
^ABC\ There is thus an uncertainty a// in 6, and hence, since B 90®, of 

hvajcf 



6072 
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in the momentum transferred to the electron. Since A = cjv this may 

be written . , ... 

Up = hajjA, 

where Ap is the uncertainty in the momentum of the electron after the 
measurement has been made. We see that 

as we expect. 


7. Wave functions for the momentum of a particle 

It is implicit in what we have said already that if at a given instant 
of time a particle is described by a wave-function ^[x)y deductions can 
be made about the momentum p of the particle as well as about its 
position X, In this section we shall show in detail how this can be done. 
We shall confine ourselves to particles moving in one dimension; the 
generalization to more complex systems is straightforward. 

With regard to the spatial coordinate, we have seen that: 

(а) The probability that x lies between the values x' and x'-{-dx' is 

\tlt(x^)\^dx\ 

(б) The mean or most probable value of a; is » given by 

x = j x\ip(x)\^dx. 

We require similar information about the momentum p. We want in 
fact a function g(p) such that \g(p^)\^dp' is the probability that p lies 
between the values p' and p^+dp\ and 

P = j p\g{p)\^dp 

irf the mean value of p. 

It is clear from the considerations of the last section that g{p) will 
be equal to the appropriate Fourier component of ^(a;). We may .thus 

sett 

g{p) = A j (36) 

— 00 

The constant A will be determined by the condition that 

jm\^dp=i. 

Thi^ gives A = l/^(27r^). 

The proof is as follows. Consider the integral 
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It may be written 

J/J 

which, on integrating with respect to p, gives 

JJ ^ Y(x)il>(x')dx(h!'. 

Integrating with respect to a;' and making p^ tend to infinity, and 
making use of the equation 

00 

r sina; , 

—00 

we obtain 2'jTh J |^(a;)|2dar. 

The function exp{—ipxlh)I^J(27Th) is known as a ‘transformation 
function’, by means of which we can transform the wave function 
giving information about x into a wave function giving information 
about p. 

The mean value of the momentum p is given by 

f = jp\9{pWdp. (37) 

It is often desirable to express p in terms of the wave-function 0(a;) 
for spatial coordinates. We shall now show that 

P = ^ f ^*^dx. 

^ I J ^ dx 

The proof is as follows: 

The quantity p defined by (37) may be written 

p == J J J dxdx'dp. 

We carry out the ^-integration between the limits ±Pol writing 
pjfl == ]cq, and x—x^ = ^, this gives 

p = ^ j j dxdx'. 

After partial integration with respect to Xy we obtain 

Carry out the integration with respect Uyx', we find as before 
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This identification of the momentum p with the operator {hli)dldx 
has consequences of importance. It enables one to find the average 
value of other quantities, for instance the energy of a particle. Thus 
in classical mechanics the energy H(p,x) of a particle moving in one 
dimension is given by 


H=: 


2m 


+ V{x). 


Replacing p by we see that the mean value W of the energy is 

given by _ 

IF = J ^*H^dx 

If 0 is the solution of a Schrodinger equation for known energy IT, 


(38) is then a trivial result, obtainable by multiplying both sides of 

(39) by ^(a:) and integrating over all x. Equation (38) is applicable for 
systems described by any wave functions. 

It win be noticed that if p denotes the operator (hli)dldx, then 

{px—xp)ili = 


In some formulations of quantum mechanicst the quantum conditions 
are introduced in this way, as 


px—xp 


.(40) 


It there appears that for any two dynamical variables / and g for which 

fg-gf^o 

it is possible to measure them both simultaneously with unlimited 
accuracy. Such variables are said to ‘commute’. If, however, /gr gf, 
the two quantities cannot be determined simultaneously. 

In finding the mean value of any quantify which contains terms of 
the form. an ambiguity arises, since, if is replaced by {hli)dldx, 

pxtji ^ xpift, 

f e.g. P. A. M. Dirac, Quanhm Mechanics, 3rd ed. (Oxford, 1947). 
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In such cases we replace px by \{px+xp), and thus px^ by 




i.e. by 


More generally, a classical term 


f{x)p^ 

is replaced by 



With this convention, operators have the ‘Hermitian’ property; 'this 
is defined as follows: If ^ and ^ are any wave functions vanishing at 
infinity and L{p,x) any real function ofp and q, 

J ^*14 dx = ^ dx. (41) 

For terms of the type the proof is obvious; for terms of the type 
f(x)p we have for the difference of the two sides of (41) 




frtLA.^ 
dx^ dx 


{m]dx 


In the next section it will be shown that the Hermitian property is 
necessary if the predictions from wave mechanics are to be in accord 
with the facts. 


8. Hamiltonian equations of motion 

In the preceding sections a method has been given whereby the 
wave functions of a particle can be set up, which will give information 
about the momentum p and the positional coordinate x. For certain 
purposes it may be necessary to consider more complicated dynamical 
systems, such as rigid bodies (for certain purposes molecules can be so 
treated) or numbers of interacting particles. For such systems we shall 
make use of generalized positional coordinates ggv* ^n> where n is 
the number of degrees of freedom of the system, and Hamilton’s 
equations of motion. We require a wave function ^(?i, 3 ' 2 >—) of aU 
the coordinates, the interpretation of which will be that 

is the probability that, at the instant of time considered, the coordinate 
qi lies between and qi+dqi, q^ between and q^+dq^, and so on. 
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We require also an equation linear in the time of the type of (26), to 
determine ^ at all subsequent times if it is given initially. 

The Hamiltonian equations on which we shall base the theory are 
as follows: A ‘Hamiltonian function* (r=l,...,?i) of the 

positional coordinates and momenta is set up; the equations of 
motion are then 

— ^ / 42 ) 

dqr' dt dp/ ^ ^ 


dt ■ 


The Hamiltonian function and momenta p^. are defined as follows: 
first the kinetic energy T and potential energy V are written down as 
functions of g,., q^. The Lagrangian function is then defined by 


L=T--V 


The generalized momentum p^, is then defined by 



(43) 


and the Hamiltonian function by 

S='EPr9r-L, 

where the q^. are to be eliminated by means of (43). 

Where V does not contain the time explicitly H is equal to the energy 
of the system. Thus for a particle in a field of force 




The generalized Schrodinger equation that will replace (26) is 

= (44) 

% ot 


where p^ is to be understood for (hli)dldq^. This is suggested, first 
because it reduces to (26) for the case of a particle moving in a field, and 
also because it leads to the classical Hamiltonian equations of motion 
(42) for the centre of gravity of a wave packet. This will now be proved. 
The mean value pj, of p^, is given by 

Pr = j 

and thus 

= J (rpj+rpAw = J 

on substituting for from (44) and writing p,. = (^/»)^/^?f The 
right-hand side gives 
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The first two terms vanish by the Hermitian property (41) and we 
are left with . „ 

Vr=- 

Examples of systems for which a wave equation must be deduced 
from a Hamiltonian function are: 


(a) Motion of a system consisting of two or more pa/rticles. If their 
spatial coordinates are and a wave function 

will describe the system, and 

j ^ dy-^ dz-^ doc^ dy^ dz^y 

will give the probability that the first particle is at the point {x^y y^, zf) 
in the volume element dx-j^dy^dz^ and at the same time the second 
particle is at the point (x^yy^^z^. 


(6) Motion of an electron in a nmgnetic field. In this case there is no 
potential energy function, and so the definition of H given above cannot 
be used. Nevertheless a Hamiltonian function can be set up so that 
equations (42) are valid. This function is 


_1^ 

2m 





where A is the vector potential defined so that the magnetic field 


H 


satisfies 


H = curl A. 


That this is the correct Hamiltonian may be verified as follows: 
Hamilton’s equations give 




and 






(46) 


where the suffix r can have the three values Xy y, z and is written for 
X. Substituting in the first of these equations, we find 


Pr 


e . dAg 


5F 

eg/ 


while the second gives, on differentiating with regard to the time 

e dA, . 
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This, in vector form, may be written 

mv= -gradF+-[vHJ, 

c 

which gives the classical equation of motion. 

In most physical applications H may be considered small, and the 
Hamiltonian becomes 

^ (pl+Pl+Pt)+~ (A^PA^vPv+A,p,)-\- V, 

from which the wave equation can be written down in the usual way. 
As we saw above, (Ap)^ has to be replaced by 

^H(Agrad^)+div(A^)}, 

and if div A vanishes this gives 

^ (A grad^). 

% 

The Schrodinger equation is thus 



II 

STATIONARY STATES 

9. The old quantnm theory 

It was Niels Bohr in 1913 who first brought forward the hypothesis 
that the energy of an atom is quantized. This hypothesis means that 
the total energy of the electrons in an atom cannot have any arbitrary 
value, but must have one of a series of discrete values, of which one 
is the lowest and is thus the energy of the normal state of the atom. 
Bohr’s hypothesis was made in order to explain a series of experimental 
facts, and a theory (the old quantum theory) was built up in accord 
with many of these facts, based on a few assumptions such as the 
quantization of angular momentum. It is, of course, one of the most 
striking successes of Schrodinger’s equation that the quantization of 
energy follows from it naturally and without the introduction of special 
hypotheses. 

The experimental facts on which the quantization of energy rests are 
widespread and of very different type. We may mention first the fact, 
fundamental to crystallography and indeed to chemistry, that atoms 
and molecules seem to have each a characteristic size.f It is useful to 
think of many atoms and ions, especially those having the ‘rare gas’ 
configuration of electrons such as K+ or Cl“, as behaving much like 
elastic spheres when brought into contact with each other. This 
behaviour seems quite incompatible with the picture of the atom that 
we owe to Rutherford, that of a number of electrons moving in the 
field of a positive nucleus; the dimensions of such an atom would 
depend entirely on the speed with which the electrons were moving 
and hence on their energy, and could be very small for large negative 
values of the energy. 

A second piece of evidence of the same type is provided by the specific 
heats of monatomic gases which at constant volume approximate closely 
to This means that the whole of the thermal energy of the molecules 

of the gas consists of translational energy, and that the internal degrees 
of freedom do not make any contribution. It follows that the electrons 
in the molecules cannot normally give or receive energy on collision 
with other molecules, and this in its turn implies that the minimum 
^amount of internal energy that a molecule can take up is too large to 

t See, for inBtance, the tables of ionic radii given by V. M. Goldschmidt {Trans. Fa/ra- 
day 8oc. 258, 1929) or W. H. and W. L. Bragg, The CrysUdUne State (London, 1933). 
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be transferred in a collision at ordinaiy temperatures. This shows 
that the energy must be quantij^ed. 

Since the hypothesis was originally put forward, numerous experi- 
.. ments have been carried out in which molecules of gases have been 
bombarded by electrons and it has been shown that the energy lost 
by a colliding electron must have one of a series of discrete values, 
unless enough energy is lost to ionize the molecule. We may thus take 
it as an established experimental fact that for any gaseous atom or 
molecule of given type there exists a definite state of greatest stability 
(the 'normal state’), in which the atom will normally be found; and 
that if it absorbs energy, either from light, a colliding electron, or in 
any other way, it will absorb an amount equal to one of the terms of 
a definite series. The atom is then left in an 'excited state’. The series 
tends to a definite limit, the ionization energy, 7. Any amount of 
energy greater than I can be absorbed, an electron being then ejected 
from the atom. 


^ The energies of an atom in the normal or excited states are usually 

Zero convention that the 

total energy is zero when one electron 
is removed from the atom and is at rest. 


-w. 






The energies Wq of the normal state and 
of the excited states are then all 
negative. They are shown schematically 
in Fig. 7. The ionization energy is equal 
Fig. 7. Energy levels Gf an atom, Wq and the first excitation energy, 

say, the smallest energy that 
can be absorbed by an atom in the normal state, is equal to \ Wn—W\ 
Further strong evidence for the existence of stationary stato in 
atoms IS the fact that the optical emission and absorption spectra 
consist of Imes. The mere existence of line spectra proves that the 
energy must be quantized, smce we know that when light of frequency 
V receives or gives up its energy to matter, it does so in quaL of 
mount h. It follows that an atom can only change its ener^ by one 
of a senes of defimte values. Further it is found that in the line 

IST. '■™'‘ V., 

I'l+Vj = Vj 

that the three hues correspond to transitions between states W,, W„ C 
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From an examination of the line spectra of atoms it is in fact usually 
possible to determine the relative values of the stationary states, all 
emission lines being ascribed to transitions between a series of states 
as shown in Fig. 7. Absorption lines in a cold gas, where all atoms are 
in the ground state, correspond to transitions between the ground state 
only and the excited states. 

In the line spectrum of atomic hydrogen the frequencies of the lines 
were found experimentally to conform to a simple formula; the fre- 
quencies can be expressed as 



On the left we have the wave-number (the reciprocal of the wave- 
length); on the right n-^ and are integers and 72 is a constant, known 
as Rydberg’s constant. It is equal to 

R = 109,677 cm.-i 

From this fact we can deduce that the hydrogen atom has a series of 
energy levels, and that the energy of the nth level is given by 
= —hcRjn^ {n — 1,2,...), 

subject to the convention already made that the energy should be zero 
when an electron is just removed from the atom. 

The hydrogen atom is a particularly simple structure, consisting as 
it does of a single electron moving in the field of a heavy positively 
charged nucleus. According to Newtonian mechanics the electron ought 
to move round the nucleus in circular or elliptic orbits; the energy 
could have any value from zero to minus infinity. Niels Bohr in his 
original theory assumed that the orbits were in fact Newtonian, but 
that only a restricted series of orbits were possible. Confining ourselves 
to circular orbits, we shall now show what this limitation must be if 
the observed series of energy levels is to be obtained. 

Suppose that an electron with charge e and mass m moves round an 
infinitely heavy nucleus carrying an identical charge of opposite sign.- 
If the radius of the orbit is r and the velocity of the electron is v, then 
the electrostatic attraction between them, e^/r^, will be just balanced 
by the centrifugal force mv^jr; we thus have 

~ = mv\ (1) 

r 

The potential energy of the system is — e^/r, and the kinetic energy 
Jmv*; adding these together we see that the total energy is 

e^/r, 
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which by (1) is equal to But we know from experiment that 

this is equal to —hcRjn^t where n is an integer; it follows that 

1 __ hcR 

or that r = nh^l2JicB. (2) 

It follows that the radii of the allowed orbits are proportional to the 
square of an integer. 

Again, from (1), v = i, (3) 

\ m } n ' ' 

and multiplying (2) and (3) we obtain for the angular momentum 

Now Bohr’s assumption was that the angular momentum should be 
quantized; and the elementary derivation given here shows that, if it 
is assumed that the electron moves in a circular orbit obeying Newtonian 
mechanics, this can be deduced from known experimental facts. There 
was no a priori reason to quantize the angular momentum in preference 
to any other constant of the motion such as the energy; Bohr’s choice 
of the angular momentum could only be justified by its results. 

Bohr also pointed out that, to within the limits of experimental error 
to which these quantities were known, me^l(2mhcB)i was equal to hl2^, 
Planck’s constot had the right dimensions and had already been used 
for the quantization of light; there was, however, no reason a 'priori 
why the multiplying factor should be l/ 27 r rather than anything else. 

The assumption made by Bohr in building up a system of atomic 
mechanics w^s thus that the anguto momentum Q of an electron in 
an atom or of any rotating system should be a multiple of ^/ 27 r or 

^=^nhl27T (»= 1 , 2 , 3 ,...). 

This assumption had a series of striking successefe of which a few wiU 
be enumerated here. 


, 9.1. ^tlomn^forO^rornicmoffhen^^aboutth^ 

According to the above analysis, the value of the Rydberg constant 
for an mfinitely heayy nucleus should be ^ 


B = 2TrV/wi&®c. 




Fot electron of mass m revolving round a nucleus of mass if, it will 
easily be seen that B is given by 
B = 


-«,/(i+5). 


The change (one part in 2,000) is too small to be detectable, since 
although B is known sufficiently accurately the other constants m, c, h 
are not. However, a check on the theory can be obtained by observing 
the lin6s of ionized helium. In ionized helium a single electron moves 
round a nucleus of charge 2e and mass approximately 4if , if if is the 
mass of the proton. Thus the line spectrum of ionized helium should, 
according to the theory, be given by the formula 

v/c = 4:B^Jn^, 




Experimental values of the two Rydberg constants are 
■Rfle = 109,722, Rh = 109,677, 

enabling a value of mjM equal to 1/1843 to be obtained, in reasonable 
agreement with values obtained from other sources. 


9.2. Structure of atoms other than hydrogen 

Although the old quantum theory was not able to yield exact 
quantitative results in agreement with experiment for atoms more 
complicated than hydrogen, it was able to give an extremely valuable 
qualitative picture of atomic structure, A theory comprehending elliptic 
as well as circular orbits was built up, and the interaction between the 
electrons was taken into account in a qualitative way. Much of the 
old theory, in particular the concept of the inner K, L, and M sh^ 
in a heavy atom, has been incorporated into the new theory, but since 
the concept of orbits has been completely discarded, we shall in this 
book build up the whole theory of atomic structure on a basis of wave 
mechanics; this will be done in Chapter VI. The very great service 
rendered by the old theory must, however, be stressed. 


9,3. Botatioml energy levels of molecules 

A diatomic molecule rotating about its centre of gravity may be 
treated approximately as a rigid body of moment of inertia /, say. 
If its angular velocity is co, then Bohr’a hypothesis gives 
• lai = nh/ihr (» = 1, 2, 8,**.). 

The kinetic energy is i/w® = nW/8ir*J, 
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This formula was used with fair success to account for the rotational 
terms in the band spectra of hydrogen (cf. Chap. V) and also to account 
for the drop at low temperatures of the rotational specific heat of 
hydrOgen.f The formula has been slightly modified by the wave 
mechanical derivation (Chap. V) according to which the kinetic energy 
of rotation is n{n+l)h^l»n^I. 

9.4. Vibrational energy 

It was assumed before the introduction of wave mechanics that the 
quantized energy values of any vibrating system with one degree of 
freedom and natural frequency v would be given by the formula 

W — nhvy 

it being supposed that a light wave was just a special case of a vibrating 
system, and that the formula valid for light would be valid for other 
systems. This formula was useful in accounting for the vibrational 
terms in the band spectra of molecules, and for the specific heats of 
solids at low temperatures. It is replaced according to wave mechanics 
bytheformuk W^{n+\)hv. 


10. The concept of stationary states in wave mechanics 

It can be shown as a quite general consequence of wave mechanics 
that the energy of any particle which moves in a limited space, from 
which it cannot escape, must be quantized. Thus electrons bound in 

atoms, or atoms as a whole vibrating 
about fixed positions in a solid, will 
have quantized energy values,. while a 
free electron in an electron beam can 
have, any arbitrary energy. This will 
most easily be seen if we consider a 
very idealized case,*that of an electron 
shut up in a box with perfectly re- 
^ fleeting sides. We need consider motion 

dimeoMon only; we thus suppose that the electron moves along 
X oip X — 0 to X = d, and at these two extremities, wh^ 
the w»^ are, the wave function vanishes, the W function must then 
havetheform , . . 

^ 4 mi(-nnxja)e-^, 

t Cf. R. H. FowW. Stotiaica Mechanic. 2f«i ad. p. 82 (C^Mga. 1^).: 



X‘0 


Fig. 8. Waive functions for electrons 
in a box. 
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where n is an integer and a constant. Substituting into the 


Schrodinger equation 


(ia;2T ^2 ¥ - 


we see that W has the series of valued 

” 2ma^ 


( 4 ) 


We thus have a series of solutions 

^ 8m(7rnxla)e-^^nilft 

which represent standing waves in the box. We also see that wave 
functions only exist corresponding to electrons with energies given by 
(4). If an electron could exist in the box with any other energy, it 
would not be possible to describe it by a wave function. Thus, if wave 
mechanics is to be universaUy apphcable to the behaviour of electrons 
it Mows that the energy of an electron in a box must be quantized 


.Potential 





Fig, 9. 


It will, be noticed that the quantum-number n has the following 
physical meaning; fa the number of nodes or zeros in the wave 
lunction between the^two extremities. 

Suppose that we noi«^ consider an electron Miut up in a box bounded 
not by perfectly reflecting sides but by an electrostatic field which 
pushes the electron hack when it tries to get out. The potential energy 
0 an electron in this field fa shoim in Kg. 9. Suppose that the electron 
nM an arbitrary energy W,, The wave function in the neighbourhood 
n e points A and ^ whera' the ‘classical’ electron would try to get 
out fa as described in Cbap. I, .§3.2; in the regions where the clastioal 
wctron cannot go ^ wiU deqay eyponentiaJly, while within the box it 
osculate. If we started to draw the wavp function either end, 

, King the solutions that decay exponentially instead of iTinTOnmng 
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exponentially, the two solutions will not in general join up in the middle; 
only for a discrete series of energy values will they do so, and these 

will be the quantized values that the energy of the electron must have. 
An electron in a hydrogen atom is held in a box very much of 
this type. The potential energy of such 
an electron plotted along a line passing 
^ through the nucleus is shown in the upper 
part of Fig. 10 : an electron with the 
energy represented by the horizontal 
line AB can move freely between the 
points A, B, where it will suffer total re- 
flection. The wave function will be as 
shown below; it will oscillate in the region 



Fig. 10. Field and wave function 
for hydrogen atom. 


between A and B and die away expo- 
nentially outside. Clearly, only for a series 


of energies will such a solution of the Schrodinger equation be 


obtainable. 


10.1. Quantization with several degrees of freedom 
We return now to the motion of an electron in a box with perfectly 
reflecting walls, but now consider motion in tliree dimensions. Let the 
boundaries of the box be given by 

0 <x < a, 

0<y<b, 

0 < z < c, 


and let the wave function vanish on these boundaries. Then the 
Schrodinger equation 


has solutionsf ^ = A sin sin sin 

a a a 

for the values of the energy given by 

^ nl nl\ 

2 m + 

We see that the aUowed values of the energy are defined by three 
quantum numbers ngj ^ 3 ), and that each quantum number (minus 
one) is equal to the number of nodal planes that the wave function 
has perpendicular to the appropriate axis in space. 

t Here and in subsequent work we omit the time factor. 
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It may happen that the wave equation will have two or more solutions 
for one value of the energy. This will be the case, for example, if two 
of the sides of the box are equal. Thus if a is equal to 6, the two wave 
functions with quantum numbers 

(^ 1 , (^ 2 > ^ s ) {^1 ^ 2 ) 

are solutions of the wave equation corresponding to the same value of 
the energy. The solutions in this case are said to be ‘degenerate’. 

It is instructive to consider the energy levels of an electron in a cubic 
box of side a. The ground state (111) is non-degenerate; the next three 
(211), (121), (112) all have the same energy, so the energy level has 
a threefold degeneracy. The next level is (122), (212), (221), and the 
next (311), etc., also with threefold degeneracy. After (222) comes (123), 
etc., giving a sixfold degeneracy. 

It will be noticed that when two or more degenerate levels exist, 
and I'hen a linear combination of them 

is also a solution of the wave equation. 


10.2. Electron moving in a closed 'path 

The following idealized problem is instructive, although it does not 
refer to any actual physical phenomenon. 

A particle of mass m is supposed to move with velocity v on the 
circumference of a circle of radius a. We denote by x the distance 
measured along the circumference of the circle from some arbitrary 
point P. The wave function describing the motion of the particle will 
then be ^ _ j^Q±imvxifi^ 


But clearly this function must be single-valued in a;; on going round 
the circumference of the circle we must return to the same value of 


Thus 


mv(2iTr) 1 
h 2 tt 


must be an integer. Thus the velocity is quantized, and has values 

V = nhjmr. 

This may be written in the form 

mvr = nh. 

This shows that the angular momentum, in this particular case, is 
quantized according to Bohr’s condition (cf. §9). 

8072 D 
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11. Wave functions for the simple harmonic oscillator f 
11,1. The simple harmonic oscillator in one dimension 
One of the simplest and most important examples in the theory of 
stationary states is that of the linear oscillator. A particle of mass M 
is held to a fixed point P by a restoring force equal to —px, where x 
is the displacement from P. According to Newtonian mechanics, it 
will then vibrate about P with arbitrary amplitude and energy, and 
with frequency v given by ^ . 

Our problem is to find the energies of vibration allowed by wave 
mechanics, and the associated wave functions. We shall find that the 
energies are given by 

— {n+\)hv {n = 1,2,3,...). 

Before writing down and solving the Schrodinger equation we shall obtain 
an approximation to the energies by a simple 
method. The potential energy of the particle is 
\px“. Tliis is plotted in Fig. 11. If the energy 
of the particle is TV, it can move according to 
Newtonian mechanics between the points A 
and Bf that is to say, over a distance 2xq given 
by 

AP = 2V(2TV/p) = 2xo. 

The average kinetic energy may be taken to be 


ipx* 


Fio. 11. Potential energy 
of harmonic oscillator. 


«o 

J (,W-ipx‘)dxl j dx = fF. 


Xhus the average 'vave-length is hl^iiMW/i). If n half-waves are fitted into 
the length AB, we have 

inh/V(4MF/3) = 

This gives W = 0-94nftr, 

an answer of the correct order of magnitude. 

We shall now write down the wave equation and obtain the solution. 
The wave equation is 

If we introduce a new variable 

t Cf. E. Schrodinger, Ann. der Phyaik, 79, 489 (1920). 
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the equation becomes ^ = 0, (6) 

where A = ^Wjhv, We have to find for what values of A solutions of 
(6) exist which decay exponentially to zero as y tends to oo. 

The solution is obtained by making the substitution 


0 = 


Substituting into (6), we obtain 

If we now express / by a power series 




n=0 


we obtain the recurrence relation 


(«+2)(»i+l)4„+j = (2n+l-A)4„. (7) 

It is thus clear that the equation has two independent solutions of the 

and that the coefficient can be determined from (7) in either case. 

Two cases now arise; first, if 2?i+ 1 —A does not vanish for any integral 

value of w, then / is given by a power series which is clearly divergent. 

For large n j o 

£n±a_^f 


For large w, therefore, successive terms have the same ratio to each 
other as those of the expansion of exp( 2 y 2 )^ follows that for large a;, 
the function / tends to infinity as exp(2y^), and thus that our wave 
frmction iff tends to infinity as exp(|y2). 

If A = 2n+l, (9) 


where nia a, positive integer, the series (8) terminates and / is a poly- 
nomial. It is only in these cases that our solution ijj tends to zero as 
X tends to oo. 

Equation (9) at once gives us the allowed values of the energy 
Wn = (n+i)liv (n = 0, 1, 2,...). 

The corresponding wave functions are 

hiy) = ^oexp(-iy2), 

0i(y) = 4iyexp(~i^^a), 

^a(y) = 
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The multiplying constants are arbitrary; but it is convenient to 
choose them so that the functions are ‘normalized*, i.e. so that 
J = 1. In that case 

Ao = n-i, Ai = (4/7r)l, Ai = (l/47r)i. 

The functions fn(y) are called Hermitian pol 3 niomials, and their 
properties are discussed in a number of text-books. | is defined by 

Ux) = 


Some of the wave functions are plotted against x (our original variable) 
in Fig. 12. We plot also |^|2, which gives the probability that the 
particle will be found in any unit interval. It will be seen that for 
the higher states the particle is more likely to be found towards 
the boundaries of its vibration than at the centre, where its velocity 
is highest. 



Fig. 12. Normalized wave functions of linear oscillator. 


It is worth noting here that, according to wave mechanics, even in 
the state of lowest energy, an osciUator still has some energy, namely 

the ‘zero point’ energy ^hv. This is a result characteristic of wave 
mechanics. 


5*^“'**” Phy^k. Bd.'l 

(Kerim, 1924). Cf. also Appendix to this book (pp. 376 et aeq). ^ 
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11,2. The bounded harmonic oscillator 

In the above analysis we imposed the boundary condition that the 
wave-function ^ vanishes at infinity. If we replace this condition by 
the one that the wave function vanishes at the walls of an enclosure we 
may obtain the energy levels of a ‘bounded’ harmonic oscillator. This 
problem has been treated by several authors.f It is found that the 
effect of enclosing the oscillator is to displace its energy levels towards 
higher energies, the displacement increasing with decreasing dimensions 
of the enclosure. Thus if we write |o = where ^(a;) = 0 at 


X == ±il then 


dU ’ 


where is the energy of the nth. state. The formula for is compli- 
cated but reduces to a simple form for > (n—\), when it becomes 


( 2 - 1 )! 


and for < n—\, when it reduces to 




showing that as -> oo. 






11,3. Simple harmonic oscillator in three dimensions 
Suppose that the restoring force is —PiX for displacements in the 
a;-direction, — p^y for displacements in the ^/-direction, and so on. 
The potential energy F is given by 

V = 

The Schrddinger equation is separable; if we write 

tjj = ^i(aj)^2(2^)^3(^)> 

then ^ is a solution if 

= etc. 

and TF = W^+W^+W^. 

If ^8 frequencies for vibration in the three directions, the 

energy values are 

n 

F. C. Auluck, Proc. Nat. Inst. Sci. Inddat 7, 133, 383 (1941); 8, 147 (1942); F. C. 
Auluok and D. S. Kothari, Science and CuUurCt 6, 370 (1940) ; Proc. Comb. Phil, Soc. 41 , 
175 (1946); S. Chandraflekhar, Astrophys. J. 97, 268 (1943). 
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12. The wave mechanical treatment of the hydrogen atomf 
In this section we shaU treat the motion of an electron in a central 
field of force, and in particular in the Coulomb field of a charged positive 
nucleus. We shall first treat the nucleus as of infinite mass; the cor- 
rection for its finite mass wiU be made in Chapter V. Denoting by r 
the distance of the electron firom the nucleus, we thus write F(r) for 
the potential energy of the electron in the field of the nucleus. If the 
electron moves in the field of a positively charged nucleus of charge Ze, 

F(r) = 

The Schrodinger equation is 

V^+^(W-V)tl> = 0 . ( 10 ) 


In this section we show that, for negative values of W, solutions exist 
which tend exponentially to zero outside the atom if W has one of the 


values 






( 11 ) 


This is the same series of values that was given by Bohr’s original 
theory. Solutions bounded at infinity exist for aU positive values of 
W ; these correspond to the ionized state of the atom and will be dis- 
cussed in Chapter IX. 

The first step in the solution of equation (10) is to transform it to 
spherical polar coordinates 0, The equation transformsf into 


i JL_ ^ 

r^dr\ dr)^r^ede\ de)^rWeB<f,» 

This is solved by setting 


2m 

W 


(]F-F)0 = 


0 . 

( 12 ) 


where 0, 0 are functions to be determined. It can easily be verified 
that ijs satisfies (10) if iJ, 0, 0 satisfy the following equations 




1 d 
smddd 


t B. SdirSdinger, Ann. der PhyHk, 79, 361 (1926). 
t C. E. Weatherbum, Advanced Vect<yr Analyaie, p. 16 (London, 1944). 


(13) 

(14) 
(16) 
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where and Z(?+l) are arbitrary constants which have been written 
in this form for reasons which will appear below. 

Now O must be a single-valued function of the azimuthal angle (f>\ 
that is to say, when ^ is increased by ^ or any multiple of 27r, 0 must 
be unchanged. Thus « in (13) must be an integer, and we have for the 
general solution ^ ^ ^ cos +JB sin 

where A and B are arbitrary constants. It will be convenient to ascribe 
to u positive and negative integral values and take the general solution 
to be ^ 

The equation (14) is the well-known equation of which the solution 
is the associated Legendre polynomial Pficos 6), If I is integral and 

I > \ul 

then there exists one solution iY(cos^) which is bounded in the range 
0 < ^ < TT. For other values of I no bounded solution exists. 

Values of Pf(cos^) are (we write Pf = ij) 

Pq{cobO) = 1, 

Pi(cos^) = cos^, PJ(cos^) = sin^, 

P2(cos^) = I cos®^— 1. 

We thus see that solutions of Schrodinger’s equation can be classified 
conveniently according to the value of Z. If Z = 0 the solution is a 
function of r only, and thus spherically symmetrical. The solution is 
non-degenerate (except for certain special forms of F(r), of which the 
Coulomb form is one). The states corresponding to such solutions are 
known as s-states.f It will be shown below that the angular momentum 
corresponding to such states is zero. 

If Z = 1, the function 00 can have either of the three forms 

cos^, sin^e^’^, sinSe-^^. 

These three independent solutions necessarily correspond to the same 
energy value; therefore any combination of them would give a solution, 

for instance, sin 0 cos sin 0 sin (^, 

i.e. 2 i/r, xjr, yjr. 

Thus three independent solutions with Z = 1 have each a nodal plane 
passing through the origin. The states corresponding to I = 1 are 
called p-states. 

The letters s, p, d,/, now used to describe states with 2 = 0, 1, 2, 3 were originally 
introduced before the advent of the quantum theory to describe certain tferies of lines. 
8 stands for sharp, p for principal, d for difhise, / for fmdameptal. 
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Similarly the states corresponding I — 2 are called d-states. 

In general there are 21+1 different values of u corresponding to a 
given value of l\ the state corresponding to a given value of I shows a 
(2Z+l)-fold degeneracy. 

We turn now to the equation (15) for the radial part of the wave 
function. We shall take V(r) to be either a Coulomb field, or a field 

of the type ^ -Z(r)e^lr, 

where Z tends to unity as r tends to infinity, and to some larger value Z^ 
as r tends to zero. We shall see in Chapter VI that this is a good 
approximation to the field in a heavy atom with one valence electron, 
such as an alkah atom. 

We first substitute in (15) 

m =mir; 

we then obtain == 0. (16) 

This equation is of the same type as the Schrodinger equation for the 

motion of an electron in one dimension. 
The type of solution can be seen by 
plotting against r the quantity enclosed 
in curly brackets. If W is negative but 
I If I is small enough, it will appear as 
shown in Fig. 13. Where it is positive / 
will oscillate, where it is negative / will 
tend exponentially to zero. The quantum 
state will be defined by the number of 
nodes in the wave function; four are 
shown in the figure. They will appear 
in the wave-function 0 as nodal surfaces 
of spherical form. 

We denote the total number of nodal surfaces in the wave-function 
iji by n 1, and call n the principal quantum number, n is then equal 
to unity for the ground state, which has no nodes. Since I is the number 
of nodal surfaces passing through the origin, n-l-1 is the number 
of nodes in /(r), i.e. the number shown as 4 in Fig. 13. We would 
expect the value of the energy aasociated with a given quantum state 
to depend primarily on n. ActuaUy for the pure Coulomb field 
(F * — the analysis given below shows that the energy'is 
actually independent of I, and depends only on the principal quantum 
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number n according to (11). This is not the case for any other field; 
for the field described above the energy with Z = 0 is the lowest. The 
scheme of energy levels is shown schematicallyf in Fig. 14. 

Hydrogen Alkali 


n=2 





71=1 





Fig. 1 4. Levels for hydrogen and alkali atoms. 


We turn now to the solution of equation (10) for the particular case 
when V is equal to — Ze^/r. If we set 




the equation becomes 


where 




(17) 


For large a;, / will behave like and as we require the solution which 
behaves like e“®, we set j _ e”^^(a;) 

We obtain, substituting into (17), 

We attempt to find a solution of (18) of the form 

g = 1,**. (19) 

a-0 

For or this gives cr((T— 1) = l(l-{-l)y 


whence a = l-\~l or —h We require a solution that vanishes at the 
origin since R must remain finite; therefore we take the first solution. 
Substituting (19) into (18) we then obtoin the following recurrence 
relation for 


t In an actual atom the lowest levels will be forbidden for the valence electron by the 
exclusion principle. 
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If 5 is very large, and 2(8+1)— oc does not vanish for any integral value 

The ratio of successive terms in the series is thus the same as the ratio 
of successive terms in the expansion of e^. Thus g tends to infinity 
as and /, and hence as c®. 

If, however, 2(8+1)— ot vanishes for any integral value of 8, the series 
terminates. g(x) is thus a polynomial and f(x) tends exponentially to 
zero. The allowed energy values are therefore given by 
a ==2(8+1) (s= 1,2,3,...), 

i.e. W = 

2¥ (5+Z)2 

We see that the energies depend only on a single quantum number n 
defined by n = s+l. 

If » = 1, we must have i = 0, if w = 2, J = 0 or J = 1, and so on. 

The wave functions will be written down in a normalized form below. 

» = 1, 1 = 0, M = 0 

n=2,l=0,u=0 


>l> = —77 — /-Ffs— 

4^(2ir)\«/ \ aj ’ 


n = 2,l-l,n= - 1 , 0 , +1 
< 1 , = -} 

4^(277')\a/ a V2sinfl 

A very full account of the normalized wave function for hydrogen 
^ be obtained from a number of text-books, for example, Pauljng and 
Wilson, Introd/uction to Qmntwn Mechanic8 (New York, 1935). 

The problem of the hydrogen atom enclosed in a sphere (the proton 
being at the centre of the sphere) has been treated by Sommerfeld and 
elkerf who find that when the radius of the sphere is less than 1-836 
tnuM the raAus of the fimt Bohr orbit, the energy of the system is 
^^e, so that the electron is exerting a pressure on the waUs of the 

12.1. fwferpreto/m 0 / jf^e 

wave fimction for an electron in an atom must be interpreted 
t A. Sommerfeld aad H. Welker, 4„„. <fer 32 . 56 (1938). 
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at the point {x,y,z). It is here assumed that the wave function is 
normalized, i.e. that 

J \^x,y,z)\^dxdydz = 1. 

An equivalent interpretation is the statement that 
e\^x,y,z)\^ 

is the charge density in the atom averaged over a time long compared 
with the period of the electronic motion. 

Some further points about the wave functions are worth noting. 
The wave functions of the s-states are spherically symmetrical; it will 
be shown later that they correspond to states of zero angular momen- 
tum. They have a maximum at r = 0, which shows that the electron 
has a large probability of penetrating into the interior of the atom. 

The wave functions of the p-states all have a nodal plane through 
the nucleus and jR(r) vanishes there; this means that the electron has 
small probability of being found near the nucleus. For d-states, where 
two nodal planes pass through the nucleus and R[r) behaves like r^, 
the probability is smaller still. 

12.2. The mcymentum density functions 

We saw in Chapter I that, if the probability that an electron moving 
on a line lies between x' and x'+dx' is \^{x')\^dx\ then the probability 
that its momentum p lies between p' and p'+dp' is 

\g(p'Wdp', 

where g(p) = p J (20) 

It is clear that this result may be extended readily to the case of an 
electron moving in three-dimensional space. Thus if ^(r) is its wave 
function in a;yz-spaoe it follows by repeated application of (20) that its 
wave function in momentum space is 

x(P) = p J (21). 

where &r denotes a volume element and (p . r) denotes the scalar product 

Px^+PyV+P.z. 

By means of this formula we can evaluate the distribution of momen- 
tum in the various states of the hydrogen atom. Thus for the ground 
state 
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Thus, using the result 

we obtain for the wave function in momentum space 

, , 2 (2^W)* 

X(P) - “ (pq:^2» 

so that the momentum density function is 

The density functions for the other states can be evaluated in a similar 
way. Pauling and Podolskyf have calculated the momentum function 
for hydrogen-like atoms, but very little theoretical work has been done 
on momentum distributions in atoms. The momentum distribution of 
electrons in atoms and molecules is of importance in electron scattering 
and will be considered later (Chap. VII). 


12.3. The continuous spectrum 

Solutions of the Schrodinger equation for the hydrogen atom also 
exist for positive values of the energy W, These correspond to the 
ionized state of the atom, and behave at infinity like r”^sin(^r-l-€). 
They will be discussed further in Chapter IX. 


13. The rigid rotator 


A further problem which can be solved by elementary methods is 
that of finding the rotational energy levels of a diatomic molecule. 
We treat the molecule as a rigid rod, able to rotate about it? centre 
of gravity with moment of inertia /. Then if 6, ((> are the spherical 
polar coordinates defining its position in space, the Schrodinger 
equation is 


¥ 

21 



+_L^1 

d<j>\ 


+ 170 = 0 . 


We solve as before by setting 


and obtain 


0 = 0(e)O(0), 

® 


t L. Pauling and B. Podolsky, Phya. Eev. 34, 109 (1929). 


and 
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This has bounded solutions Pf(oos 9), if 

W j= m(l+l)l2L ^ (22) 

This differs from the corresponding solution obtained from the old 
quantum theory in that Z® is replaced by Z(Z+1). 

The bounded rigid rotator has been considered by Sommerfeld and 
Hartmaim.f 


14. Schrddinger’s equation in invariant formt 

The invariant form of the Schrodinger equation may be obtained 
from the kinetic energy by a simple process. Suppose that in terms 
of generalized coordinates (i = l,...,n) the kinetic energy T of a 
djmiamical system may be written in the form 


T = % 


then we may consider the quantities to be the metric tensor of a 
coordinate space of n dimensions. Thus in rectangular Cartesian 
coordinates the a^j then represent masses and the the inverses of 
these masses. In this space the invariant form of the Laplacian 
operator is 


Va ^ dq^ \ dq^l 


The other invariants in the wave equation can be constructed similarly. 
We can write the Schrodinger equation in the invariant form 




dt ' 


0 . 


(2S) 


14.1. Symmetrical top molecules 

If a molecule has an axis of symmetry this axis will be one of the 
principal axes of inertia. If a molecule (such as CHgCl) has a threefold 
axis the moments of inertia about any three axes rjri'y lying in 
a plane normal to the axis of symmetry are equal. Since the momental 
ellipsoid intersects this plane in an ellipse, and since no ellipse can 
have three equal diameters at angles of 120° unless it degenerates into 
a circle, the curve of intersection must be a circle and the momental 
ellipsoid an ellipsoid of revolution. Thus a molecule of the type with 
a threefold flyia behaves dynamically like a symmetrical top. Further- 
more, there are molecules of lower symmetry (or no symmetry at all) 

f A. Sommerfeld and H. Hartmann, Ann. der Phyaiky 37, 333 (1940). 
i Cf. E. Schrodinger, ibid. 79, 748 (1926). 
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in which it so happens that two of the principal moments of inertia 
are equal; they, too, behave like symmetrical tops. 

For this reason the wave equation for a symmetrical top is of interest 
in quantum mechanics. f 

To describe the motion of a symmetrical top we make use of Euler *s 
angles 6, (f>, x- ^ denotes the angle between the z-axis in space and the 
axis of symmetry Oz' of the top, (f> and x the angles between the line 
of nodes on the line of intersection of the planes xOy, x'Oy* and the lines 
Ox, Ox' respectively. If A and 0 denote the moments of inertia of the 
top about Ox' and Oz' respectively, then the kinetic energy T of 
the top is given in terms of the generalized momenta p by 

the equation 



The Schrodinger equation for this system is then, by equation (23), 


2cos^ ay . 2AW f _ 
sin^^ dxd<f> ^ 

This equation is separable if we write 


(24) 


■ = (26) 

where m and « must be chosen to be integers if ^ returns to the same 
value when <(, and are increased by integral multiples of 2n. Substi- 
tutmg from equation (26) into equation (24) we obtain 


^ + oot»— -(mcosece-%cot 0 ) 20 +o 0 = O, (26) 


n\ 


(27) 


where we have written a = ^ 

C' 

If we make the substitutions 

« == i(l+cos 0 ), 0 = lt*l’»+"l(M— l)»l»n-»li 7 ^ 
then equAtioB (26) can be brought to the form 

, 0 , (28) 
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2a = |m+m|+|TO-»|+l+(l+4<7+4w*)t, 

2i8 = |»»+to|+I»»-«|+1-(1+4(t+4»*)*, (29) 

y = |m+n|+l. 


It follows immediately firom the definitions (29) and the fact that m 
and n are in^gers that y— 1 is a positive integer and that 

“+i8-y > 0. 


Originally 6 lay in the interval 0 < < w so that 0<M<l;ifI7is 
to be finite in this interval jS must be zero or a negative integer. This 
will only be so if 




(30) 


where 


j = J{|m+re|+|m-w|}, 

as is easily seen from (29). Hence j must be zero or a positive integer. 
Ehminating a from equations (27) and (30) we have finally for the 
energy levels 


% = 


(s-j)”-]' 


(31) 


which is Dennison’s formula for the energy levels of the rotational 
motion of simple molecules. This formula can also be derived in a 
semi-classical way.f 

The wave mechanical theory of a symmetric top which in addition 
to the usual three degrees of rotational freedom has also a degree of 
torsional freedom between two of its principal parts is of interest, since 
certain molecules, the simplest of which probably are ethylene (C2H4) 
and ethane (C2H4), are thought to behave in this manner. In this case 
two angles and ^2 S'l’© required in place of the ^ above; <j)i denotes 
the angle between the line of nodes and an aj-axis fixed in the lower 
part of the top, and ^2 is similarly defined for an axis fixed in the upper 
part. The angle of twist is then ^2~<^i potential energy, which 

was zero in the case of the rigid top, is now of the form 


V = 2^1— cosm(^2“~^i)}‘ 

The energy levels for this system have been investigated by Nielsen, { 
to whose paper the reader is referred for further details. 


t See, for instance, G. Herzberg, Infra Red and Raman Spectra of Poly^Oomic Molecules 
(New York, 1946), pp. 24-6. 
t H. A. Nielsen, Phys. Rev. 40, 446 (1932). 



Ill 
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15. The orthogonal property 

Tms property is of great importance for the further development of 
wave mechanics. It will be explained first for particles moving in one 
dimension (along the a;-axis). It will be supposed that the particle is 
described by a wave-function 0(a;) satisfying the Schrodinger equation 




( 1 ) 


It will be supposed that a set of solutions exists either satisfying the 
boundary conditions that ^ tends to zero as x tends to ±cx), or that 
0 vanishes at the extremities of some range of x. (An alternative 
boundary condition which is of importance in Chapter VIII is that dijjjdx 
vanishes at the extremities of some range of rr.) Then the orthogonal 
property states that, if are solutions of (1) corresponding to 

different characteristic values of If, and W 2 , 

( 2 ) 

the integration being over the range of x considered. 

The proof is as follows: from (1) we have the Schrodinger equation 
dVl . T V / 


and therefore 




I'W 


0 . 


Multiplying the first equation by and the second by and sub- 
tracting, we have 

,s, 

If we integrate from Wj to the second term gives 




dx dx J 


The boun^ conditions for ^ are that either d or d^jdx should vanish; 
the term in square brackets therefore vanishes and it follows that 

J 02^1 da; = 0. 

If the particle moves in three dimensions, we have 
2m 
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with a corresponding equation for We obtain as before 


On integrating over the space in which the electron moves, the first 


where the variable n denotes the normal component of the vector and 
dS an element of the surface bounding the space. Since either ift or 
dilj/dn vanishes at all points on this surface, it follows that 

J/J 

If the system contains degenerate states, i.e. a number of states 

which have all the same energy, it is always possible to obtain 
linear combinations of these, 


'I'l = 

8 


which are orthogonal to each other. Thus all the wave functions of 
the system which correspond to bound states may be considered 
orthogonal to each other. 


15.1. Expansion of an arbitrary function in terms of uoave functions 
For the discussions of this section it will be convenient to consider 
only systems for which all the energy values are quantized. This will 
be the case for a particle in a box with rigid reflecting walls (Chap. II) 
or for a simple harmonic oscillator. It will not be the case for the 
hydrogen atom, unless we suppose the atom to be shut up in a large 
box with rigid reflecting walls. 

Suppose then a particle moves in a field of potential V{x, y, z) and that 
the quantized energy-values and corresponding wave-functions 
are given by ^ 

VVn+^(W;~F)^n = 0, 


where satisfies certain boundary conditions (i.e. that should tend 
to zero as r tends to infinity). Suppose that fix^y^z) is any arbitrary 
continuous function satisfying the same boundary conditions. Then it 
may be shown that/(a;,y,z) can be expanded in the form 

fi^y y, 2) = 2 J5). (4) 

n 

Fdr a proof the reader is referred to Courant and Hilbert, loc. cit. 

• The expansion is clearly a generalization of Fourier’s theorem, the 
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functions y, z) taking the place of the sines and cosines in a Fourier 
expansion. The coefficients can be found m the same way as Founer 
coefficients. Using the orthogonal relation 

J = 0 (n ^ n') 

and assuming the wave functions to be normalized, so that 

J 1> 

then if both sides of (4) are multiplied by and an integration is 
carried out over all space, we find 

The expansion (4) has an important physical interpretation. Sup- 
posing f(x, y, z) is a wave function describing the state of a particle at 
a given instant of time. Then if the expansion (4) is made, the coeffi- 
cients wiil have the following physical significance; is the 
probabiKty that if the energy of the particle were determined, it would 
be equal to 1^^. 

The assumption that the coefficients ^4^ can be interpreted in this 
way is a new one, and is not inherent in the arguments developed up 
to this point. It is similar to the assumption made in Chapter I, that 
the Fourier component of a wave function gives information about the 
momentum of a particle. 

It will easily be verified that 


16. Transformation functions 


In the last section we showed that if a particle is described by a wave- 
function f(x), then P 


gives the probability that particle has energy In section 7 it was 
shown that 


|p/' 


-ipxlhfdx 


dp 


gives the probability that the momentum lies between p and p+dp* 
The functions 


are known as 'transformation functions’; they can be used to obtain 
information about the energy or the momentum from a wave functitm 
giving information about the position of a particle. In this section this 
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procedure will be generalized, and a method will be given whereby 
information about any dynamical variable can be obtained. 

Let the dynamical variable under consideration be denoted by 
L(p,a;), a function of the momentum and spatial coordinates. Besides 
the momentum and energy the most important variable 

that we shall have to consider is the angular momentum 


yPz-^Py^ ^Px-^Pzy ^Py~yPx- 
In this section, however, L will be assumed to be a quite general 
function of p and x. 

If, as in §7, we replace p by the operator {hji)dldx, then L can be 
regarded as an operator. We may thus set up a differential equation 

which, if giX^) is subject to appropriate boundary conditions, will define 
a set of characteristic functions gj^ which satisfy the boundary condi- 
tions if and only if L' has one of a series of characteristic values. For 
example, if L is the energy 

L — p^l2m+Vf 

(5) is the usual Schrodinger equation. If L is the momentum p, the 
equation takes the form ^ ^ 

with solutions g = 


It will now be shown that the functions gLix)^ the characteristic , 
solutions of (5), are the required transformation functions; in other 
words that, if a particle is in a state described by a wave function ^(a;). 


I J dx^ 

is the probability that the variable L has the value L\ 

First of all, let us consider g^^x) as a wave function. Then, if a 
particle is in a state described by a wave function g^^x), the average 
value of L is ^ \ 

j ^'{x)Li^—,xjgL-i^)dx. 


If the wave functions are normalized, this by (5) is equal to L'. Further, 
making use of (5) we can easily show that 


J gf,.(x)[L-L']%.{x)dx = 0, 


where L as before denotes the operator L 



It follows that if 
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the particle is in a state described by gt'(ir), then the value of L is 
certainly U. The functions Qjj stand in the same relation to the variable 
L as the wave-functions ^Jx) do to the energy 
Suppose now that the particle is in a state described by any wave- 
function ^(x). We can, as in the last section, expand ^ in a series 

As before, will give the probability that the variable L has the 
value U, But, by the orthogonal property of the functions gjjy 

= / >li{x)gl{x)dx. 

This shows that the functions gi,-{x) are the required transformation 
functions. 

16.1. Formulation of probletns in momentum space 

A useful method of solution of quantum mechanical problems consists 
in a transformation of the Schrodinger equation to momentum space. 
This method has particular advantages when applied to the solution 
of problems in theoretical nuclear physics.f The transformation of the 
Schrodinger equation to momentum space is equivalent to applying 
the method of Fourier transforms to the solution of the equation. In 
this section an outline of the method will be given; for a rigorous 
mathematical theory of the representation of the Schrodinger equation 
in momentum space reference should be made to a paper by Gordon.f 
For simpKcity we shall consider a system with only one degree of 
freedom, the extension to higher degrees of freedom being purely 
formal. If we denote the coordinate and momentum of the system by 
X, p respectively, the Schrodinger equation may be written in the form 

According to the theory we have developed above the momentum p is ' 
interpreted as a differential operator (fili)dl8x and the potential energy 
F(a:) acte on the wave function ^ as an ordinary multipHer. It is easily 
shown by integration by parts that 

— CO 

Svarthota, Tke Birring Ener^ of tko Atomic Nuclei (thesis.. Luad. 

t W. Gordon", Skand. Matem. Kongr. Stockholm. 249 (1934). 
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SO that when the problem is transposed from the ordinary space x to 
the momentum space p by means of the transformation 

00 

X(2>) = ^ J 

— 00 

we may write the Schrodinger equation in the form 

— 00 


By the Fourier inversion theorem 


ao 


SO that 


J X(p')dp' J dx. 
— 00 — 00 — 00 

00 

Writing ^(p~p') ^ \ j V{x)e^-‘^'>^l^dXy 

— 00 
00 

wo obtain ^^-W^xiP)+ J ^ip-p')xip')dp' = 0. 


The solutions of this equation differ according as W is positive or 
negative. If W is negative and we write 

,f>{p) = {p^-2mW)ix{p), 

K{p,p') — '-2m{p^—2mW)-^p'^—2mW)-h(p—p'); 

we then obtain the homogeneous integral equation 

00 

iip) = / Kip,p')<l>{p')dp’. 

— 00 

Approximate solutions of this integral equation may be constructed by 
means of the Gauss-Hilbert variational principle and the method of 
iterated functions due to Kellog.f 

In certain cases, however, it is possible to obtain exact solutions of 
this equation. For example if 

Vix) = 


t O. D. Kellog, Math. Ann. 86, 14 (1922). 
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then I z= — --5 — 

J ^ ' ci^+(p-p7' 

— 00 

so that if we write = —2mW, X = —2Vomla^ the integral equation 
for the wave-function x(p) reduces to 

00 

(P^+Po)W=-^ I ^q:(^^x(p')cip'. 

— 00 

If we assume a solution of this equation of the form 

- . v(b) = V l-.i 

then by direct substitution and use of the result 

i f P' fy' ^ _^P 

W J a‘+(p-pyp'!‘+y^ pl^(a-fy)2 

— 00 

we obtain the relation 


T c o[ 1 - _ >„2 V /. P 

^0 ”^1- I’'+(««+!po)J 2 ’‘''p‘+(an+pl)' 

Both sides of this equation represent the same function if the coefficients 
c„ satisfy the relations 

c — — 

00 

= 0. 

n-0 

Substituting from the former of these equations into the latter 
obtain the relation 


we 


00 

®o+®o 2 


n! 


(H2po/a)...(»4-2po/o) 


= 0 . 


which may be written in the form 


A-^*'“^2p.;a(2VA) = 0, 

JJz) denoting the Bessel function of order n. In this transcendental 
equation A and a are known, so the equation may be used to find p., 
ne. the possible values of the energy, If, of the system. Once p„ has 
been determmed in this way the wave function ^(p) can be obtained 
as an mmute series. 



THE ELEMENTS OF PERTUBBATION THEORY 


71 


§17 

17. The elements of perturbation theory 

17.1. Perturbation thexrry for non-degenerate states 

In this section, following the considerations of § 8, we shall write the 
symbol H for the operator 

5 = - |^V2+F. 


If the solutions of a Schrodinger equation 

= 0 (6) 

are known, together with the corresponding discrete energy values 
then it is possible to calculate the change in the energy values that 
results if H is changed by a small amount U. The term U is called the 
perturbation, and the analysis by which this is done is known as 
perturbation theory. Examples are: the change in the energy of the 
stationary states due to an electric or magnetic field, the interaction 
between the electrons in a two-electron atom, and so on. 

The first-order perturbation may be obtained as follows: let and 
be the wave function and energy value of a non-degenerate solution 
of equation (6), The perturbed equation is 


Let us write 


{H+U-W)ilf = 0. (7) 

W = W,+w, 

^ = 0n+/* 

Then retaining in (7) only small quantities of the first order in U, Wy 
and/ we have (H-WJf+(U-w)^„ = 0. (8) 

Now let us expand / in the form 
Equation (8) becomes 

n' 

If we multiply by and integrate over all spatial coordinates q we 
obtain r ^ 

= J (9) 

which is the required result. The coefi&oient in the perturbed wave 
function can be obtained in the same way: 






W —W. 


( 10 ) 


As an example, we take an electron moving in a central field with 
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potential energy function Ze% and try the effect on the ground state 
of changing to Z+1. Then 

U= -e’^lr 
00 » 00 

and w = j dr / ( dr, 

0 

where 

On evaluating this gives 
which may be compared with the exact value 

It will be noticed that formula (9) predicts a big change in the energy 
if is large in regions where the perturbing field is large. Thus, for 
example, a change in the field near the nucleus of an atom will lead 
to a much larger change in the energy of the s-states than the j)-states, 
for which ift vanishes at the nucleus. 


r / J 

a = 

mZe^ 


w = 




17.2. Matrix elements 
The quantity defined by 

is termed the ‘matrix element’ of U with respect to the two states n\ 
n". It occurs in (10) for the first time in this book. 

From any dynamical variable L(jp,q) and any set of transformation 
functions gjq) a matrix may be formed, of which the element n, n’ is 




For the Hermitian operators which occur in quantum mechanics, it 
follows that „ complex conjugate of (n'\U\n), 

and also that the diagonal elements are real. 


17.3. Perturbaiion theory for degenerate states 
Let us suppose that the unperturbed system satisfying equation (6) 
has states 0^, ^ with the same energy. Then the perturbation will in 
general split the degenerate energy value into two with energies 
If +M ?2 and wave-functions where 

02 = -^2 0 o +*® 20 ^ 


( 11 ) ‘ 
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In practice there are two ways of finding the coefficients B^, etc., 
and the energies w^. As before, if we insert into the wave equation 

we have (U-w)^i+(B-W)/i = 0. " 

Since are orthogonal, and J dq vanishes, 

= (12) 

Thus, if we can choose i/tj, by inspection such that orthogonal 

and (12) is satisfied, then the energies can be obtained from 

»>i= f Wg = j ifi^U^^dq. 

Alternatively, inserting (11) into (8), multiplying by integrating 

over all g we find U^ = 0. 


Similarly Uf,a+B^(Uf,f,—w) = 0. 

Eliminating A^, B^ and remembering that we obtain the 

following quadratic equation for w^: 

(13) 


18. The Rayleigh-Schrodinger method 
18.1, Unperturbed system non-degenerate 

In the preceding analysis we retained only small quantities of the 
first order in [7, w, /, In some problems, however, it happens that the 
matrix element 


(«|Cri«) = 


vanishes, so it is necessary to examine terms of higher order. Suppose, 
for instance, that it is possible to expand the perturbation U in the 

form Cl = A£fi+A*£Ej+A»H,+ ..., (14) 

where A is a parameter, and that are the wave function and 

energy value of a ncm-Segenerate solution of equation (6); then we may 
assume that the solution of equation (7) may be written in the form 

0=>n+A^n+A^Xn+A«0n+-. (1^) 

with corresponding energy value 

, = W;+A«;^+AV^+.... (16) 

Substituting from equations (14), (15), and (16) into equation (7) and 
rearranging in powers of A we obtain 

+A*[(J?,-vr„)^„+(£r,-u>J^„+(ff-FJx„]+- = 0. 
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The term devoid of A is identically equal to zero by equation (6). 
Equating to zero the coefficients of A we obtain the set of equations 

= 0 (17) 

= 0, (18) 

If we expand in the form 


then 


i 


multiplying both sides of this equation by iji*, integrating, and making 
use of the equation . 

we can obtain the result J dr = 0, so that, from equation 

(17). , 

\K(Hy-w,)^.^dr = Q. 

From this it follows that . w„ = (« l/fj |n). 

Furthermore, 

./ ^ am f >f>j*(H-W„)^idr 

= a„j(Wi-W^) 

~ ~ J dr 

= -(min), 


(19) 


giving 

and we have 




. (») 

J ) 

the daeh on the summation sign denoting thai the term j = » is 
omitted. J I" io 

Substituting from equations (19) afrd:(20) into equation (18), we 
obtam the equation i \ wo 

(^*~”^")'^”'*’2 (”■ 1^1 l «)} 0 J + ( Il - Wn)Xn = 0 - ( 21 ) 

If ..write 

then it can easily be shown that 

lrn(a-w,)x„dr^Q, 
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so that, multiplying equation (21) throughout by and integrating, we 

- ^'Umn){n\H,\j) ^ 22 ) 


[plying both sides of (21) bj 


Similarly, multiplying both sides of (21) by tftf (I n), we obtain 

(l\H,\n) , V' (231 

Generally only these earlier terms of the expansion are used in actual 
calculations; the higher terms become increasingly complicated. The 
final values for the perturbed wave function and energy value are thus 


W —W 

j ^^3 




and 

If = W,+X(n\Hi\n)+x4(n\H,\n)+ <^il^^®J!L)j+0{A=>). 

\ y ffn I (25) 

In cases for which the elementary method fails, i.e. («|l^|n) = 0, we 

A(«.lSi|w)+A*(n|ifjln)+... = 0, 

SO that the formula for W reduces to 


IT - ir.+A* 2' 


-"i+r 


(n\U\i)(j\V\n) 


W„- 


-1^. 


(26) 


This method of obtaining an expression for the energy of the per- 
turbed system was devised by Schr6dinger;t it is the nature^ extension 
to wave mechanics of Rayleigh’s method J of approximating to the 
modes of vibration of a contmtious solid. The procedure adopted above 
is also analogous to the methods of Newcomb and Linstedt§ for the 
solution of the equations of dynamics in terms of trigonometric series. 


18.2. Degenerate systems 

When the unperturbed syster^i &^degenerate the procedure is the 
same in principle. Suppose, for instance, that there are k independent 

■ t E. Schrodinger, Ann. der Phy»ik, 80, 437 (1926). 
t Lord Rayleigh, The Theory of Sounds vol. i, p. 113 (1894). 

§ H. Poincare, Mithodes nouveUea de la m4caniqm cdleate. 
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wave functions of equation (6) corresponding to the energy value W^. 
Denote these wave functions by 



and suppose that they are normalized and orthogonal to each other and 
to all other eigenfunctions. Then the set of k wave functions 

(27) 

i='i 

(m— 1,2,...,^) correspond to the energy and satisfy the ortho- 
gonality and normalization property 

J ~ 

provided the coefficients are chosen to satisfy the relation 

k 

J=1 

If we now assume a solution of equation (7) in the form 

then as before wo have, on equating the coefficient of A to zero, 

= 0. (29) 

If we write as a series of the wave functions, say 

^n,m “ 2 (30) 

and replace by the expansion (27), then we obtain the equation 

i = 0 (i== 1,2,...,^) (31) 

3^1 

by a procedure similar to that employed in the non-degenerate case. 

In equation (31) (i\Hi\j) now denotes the matrix element 

(W) = / 

The set of equations (31) determines the constants To determine 
the k values of the energy into which is split by the perturbation 
we eliminate the from equation (31) to obtain the determinantal 
equation 

(l|^fi|2) . . . (ll^ilit) =0. 

(215i|l) . . . (2\H,\k) 

(felfijll) (fclfl’ 212 ) . . . (k\Hi\h)—w^ (82) 

The determinaat on the left-hand ade of this equation has, of couise, . 
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h rows and h columns. In general the equation will have h distinct 
roots. Corresponding to each of these roots there will be a set of 
coefficients determined from the equations (31). These coefficients 
will, in turn define a unique wave-function defined by equation 
(27). The perturbation will then, in general, split up the energy 
into k separate energy levels W^+Wj^ (m = 1, 2,..., k) and will determine 
a unique set of wave-functions Once these are known the 
coefficients in the expansion (30) for the wave function can 
be determined as in the non-degenerate case. 


18.3. A modified form of the Schrodinger theory 
There are many problems in quantum mechanics in which the 
solutions of the unperturbed Schrodinger equation for the higher values 
of W are not known with any accuracy. We shall see later (§§ 25, 33) 
that the wave functions of the helium atom and the hydrogen molecule 
are known accurately for only the very lowest energy states. If, how- 
ever, we wished to determine the wave function of such a system 
perturbed from its normal state by means of equation (24) above, we 
would require to know all the eigenfunctions, not only the one which 
is perturbed. It is, however, possible to increase the accuracy of such 
calculations by transforming the series on the right-hand side of equa- 
tion (24) to a form which is more rapidly convergent. 

We may write 

= I mim = r 

, j j 

BO that 


\n) 


^y umn) . 

t ^n-wr 

Thus equation (20) can be put in the form 

■P" +Zw,{W^-W/^- 

Substituting from equations (19) and (33) into equation (18) we obtain 

+(H-w;)x« = o. 


(33) 
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Multiplying both sides 6t this equation by iji* and making use of the 
result r 

J = WHl\n)-(n\Hi\n)^ 

we obtain finally 

(34)‘ 


TDT, 




K 7 

The integral (w|-^^f|n) ^s as easy to evaluate as (njl/iln) and the series 
usually converges much more rapidly than those involved in (25) and 
{26). In the case in which {n\U\n) = 0 we get for the energy of the 
perturbed systto 


Tr = }r„+M^)_ 




Wn(W,-Wj) 


(36) 


18i4. Lennard-J ones’ s form of perturbation theory 
A method of obtaining the results of the Schrodinger perturbation 
theory which has the advantage of indicating exactly what is neglected 
in the above analysis and also of dispensing with the assumption that 
the perturbation is small has been devised by Lennard-Jones.f 
If we assume -that the solutions of equation (6) corresponding to 
an energy-value T^are non-degenerate, then we may write the solution 
of equation (7) in the form 

= (36) 

then = {j\U\j)^^. ( 37 ) 

Substituting from equations (36) and (37) into equation (7) we have 
. I W),l,i+ ^ ai(i\U\j)^, = 0, 
abS equating coefficients of ifif we obtain 

" - , ^iW-W^) = ^a^(j\0\i), ( 38 ) 

f <6 =f 1. 2....), the series on the right-hand side having an infinite number 
f equations (38) determine the coefficients o, and 

, of To determine the possible values of W we 

® *** equations to obtain the detenni- 

|“iPS|equatioja 

^ ' 1/11 TTJi V 


’ ■ ■ ' * V 

^3-, E. IfflUard-Jones, Pr^. gay. Soe. A, 129, 608 (1980), 


•(89) 
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the determinant on the left-hand side havfhg an infinite number of 
rows and columns. The values of WJ, WJ,..., and the wave^functions 
^ 2 ,.,. being known, the matrix elements {i\U\j) may be calculated by 
quadratures. When equation (39) has been solved for W the wave 
function may be obtained from equation (36), after the coefficients \ 
have been found from (38) or from the expansion of a determinant of 
the type 



^1 

02 

0i • • 

0= 

(2|C^|1) 

(2\V\2)-{W-W^) . 

(2|£^|») . . . 


(i|i7|l) 

(i\V\2) ^ 

• . . . 


the values of W already found being inserted. 

For definiteness, suppose we ydsh to determine the perturbation of 
W^. Then as a first approximation to the solution of equation (39) we 
may neglect all the matrix elements {i\U\j) except (11^|1) and obtain 

in agreement with equation (19). To obtain a second approximation 
we neglect all those matrix elements (i\U\j) which do not lie in the 
first row or the first column of the determinant on the left side of 
equation (39). In this way we get the equation 

(l\U\l)^(W-W,) (l\U\2) (1|C713) . . . =0. 

(21ir|l) (2\U\2)^(W^W,) ' 0 ... 

(3|C^|1) 0 (31^7|3)-(If~Tf3) . . . 


Expanding this bordered determinant we obtain 
{(l|C^ll)~(Tr~~TfJ}(F3~W)(W8--W)...^ 


vMJMra. 

3 * 




by means of a well known theorem on detefminants. Dividing >thiou^» 
out by the infinite product (W|— rearranging ihf 
terms, we obtain as our second approximation 


If = (40) 

M agreement with equation (26) except that no# W appem in jdaoe 
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of Wi in each denominator on the right-hand side. Equation (40) must 
be solved by the 'method of successive approximations, 

w=w,+m\i) 

being taken as a first approximation. 

The theory for degenerate states can be developed in precisely the 
same way. 

18.5. Rotating polar mokcnles in an electric field 
As an example of the application of the second-order perturbation 
theory to a particular problem, we consider the solution of the 
Schrodinger equation for rotating molecules subject to an electric field 
of strength F — a problem of some interest in the theory of dieleotrios.f 
If we treat the molecule as a rigid body symmetrical about an axis we 
may then write the wave equation in the form^ 

where I is the moment of inertia of the molecule about its axia of 
symmetry and p is the permanent electric moment of the molecule. 
We saw previously that the unperturbed wave equation (F = 0) has 
bounded solutions ^ c^P«(cosO)eH 

where is a constant and 

Moreover, if the wave functions ifs are to be normalized to unity we 
inust take 


\21+l/ • 

In the notation of the previous sections we have 

U = iiFoosd 
and 

' = J dr = J 

0 

No# it MJi be 8liown§ that the integral on the right vanishes utiIam 
j = *±I, and that 

(ficA«)i,-_i) = (,-i|cn»)|i) = p 

I w rt. ^ ^bye, Potof. MoUkan, ohsp. U (Leipdg, 192«). 
tef.C«»p.n.|14.»b^. iCf.App«dix. . 
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Thus (l\U\l) and we must use equation (26) to find the perturba- 
tion in Wi\ the series on the right-hand side of (26) leduoes simply to 
two terms to give 


W=^Wi-{ 







(z+ 1-f i 4 )(z+ 1 — -w) n 

1 ^ ^ 2.|/(2;-l)(2i+l) 

(Z+l)(2i+l)(2i+3)iJ 


where we have written K == 2IfiFlhK In particular the perturbation 
of the lowest energy level (ia = Z = 0) is 


TF = ~ 


j 


21 ‘ 6 


These results have been derived by Lennard-Jones using the determi- 
nantal method.t This method is also applicable in the case of large 
electric fields when K can no longer be assumed to be small. 


18.6. Wigner's perturbation method 

It often happens that the infinite series in equation (25) diverges in 
cases in which the lowest energy value itself is finite. In such cases 
the Rayleigh-Schrodinger method is, of course, inapplicable. Wigner{ 
has given an approximate formula for the lowest energy which can be 
proved to converge at least in certain simple cases. Suppose we wish 
to determine the energy of the lowest state of the system whose wave 
equation is (41) 

when the energy values and the wave-functions {n = 1, 2,...) of 
the equation with A = 0 are known. Assume a solution of (41) of the 

^ (42) 

for the lowest state. Using the result 

it is easily seen that for the wave function (42) 

(i|£r|i)+A2{(i|Pli)%+{ilc^|iK}+^*I X 

t J. E. Lefmar^-Jones, loc. oit., § 6. " 

► t E. Wigoer, Maih, u, natiurw, An*, der ungar, Ahad, d.^Wiss, 53, 477 (1936). 
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Similarly it is easily shown that 

f >li*Ho>l>dT= WJ+A* 

Adding the two integrals and writing 




we obtain for the energy of the perturbed system 


( 43 ) 


2 2 U\H,\k)afa, 

As 4* r— j^2k=2 

l+X^J^afaj 

Neglecting A® and higher powers of A we obtain 

Differentiating this equation with respect to oj and setting BW/da^ to 
zero we obtain the relation 


which may be solved to give 


Ot = 


W-E^ 


( 46 ) 


Substituting from equation ( 46 ) into equation ( 44 ) and neglecting 
powers of A higher than the third, we obtain the equation 

W-E, = A» j-w V (l|-Sil*)(*l^ilW|J?,|l) 

* i~(^-^k){W-E,) 

for the determination of the energy W. 

The value of W obtained from this equation is always too high. 
Contmumg this process Wigner obtains the equation 


2 ti 

where 


TiniW) = (A)^« y ... y aigiMfa|g,iM.)...(ii.-,iaiud 

This equation for IF ^ be solved by successive approximations. The 
senes ( 48 ) converges m many eases in which the Rayleigh-^ohrodinger 
perturbation theory le^ to divergent results. It can converge em 
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in the case of a continuous spectrum which has h©on made discrete by 
imposing a finite boundary on the problem (as in the bounded linear 
oscillator § 1 1.2, above). The series (48) was first obtained by Brillouin,! 
who obtained it by an intuitive consideration of the equations of the 
Schrodinger method. 

t L. Brillouin, Joum, d. Phys. 4; 1 (1933). 
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Similarly it is easily shown that 

f >li*Ho>l>dT= WJ+A* 

Adding the two integrals and writing 




we obtain for the energy of the perturbed system 


( 43 ) 


2 2 U\H,\k)afa, 

As 4* r— j^2k=2 

l+X^J^afaj 

Neglecting A® and higher powers of A we obtain 

Differentiating this equation with respect to oj and setting BW/da^ to 
zero we obtain the relation 


which may be solved to give 


Ot = 


W-E^ 


( 46 ) 


Substituting from equation ( 46 ) into equation ( 44 ) and neglecting 
powers of A higher than the third, we obtain the equation 

W-E, = A» j-w V (l|-Sil*)(*l^ilW|J?,|l) 

* i~(^-^k){W-E,) 

for the determination of the energy W. 

The value of W obtained from this equation is always too high. 
Contmumg this process Wigner obtains the equation 


2 ti 

where 


TiniW) = (A)^« y ... y aigiMfa|g,iM.)...(ii.-,iaiud 

This equation for IF ^ be solved by successive approximations. The 
senes ( 48 ) converges m many eases in which the Rayleigh-^ohrodinger 
perturbation theory le^ to divergent results. It can converge em 
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split by a field H in the ^-direction into a number of states, with energies 
differing from the original state by 

(jlHu, 

where is a quantum number having a number of discrete values. 
Then the force on an atom in the state u is 


The beam is thus split by an inhomogeneous field. The number of 
beams into which it is split gives the number of possible values of 
and the magnitude of the splitting gives the magnitude of fjLdHjdz. 

(b) The Zeeman effect. Emission or absorption lines in the line spectra 
of elements are frequently split by a magnetic field into a number of 
components. On the quantal interpretation this means that the energy 
levels are split. Unlike the Stern-Gerlach experiment, the Zeeman 
effect givfes information about the excited as well as the ground state. 

(c) Paramagnetic salts. The susceptibility of a salt containing N 

magnetic ions or atoms per unit volume is, if interaction between them 
can be neglected, N 2 ^ 

V hPZ, ' 


where the summation is over the v states of the ion. and u refer of 
course to the ground state of the ion. For the derivation of this formula, 
cf. Stoner, loc. cit. 

It will first be necessary to evaluate the angular momentum of an 
electron in a hydrogen-like atom. 


19.2. The orbital angular momentum 

In this section it will be shown that if an electron or other particle 
is described by a wave function 

<fi(x,y,z) = iJ(r)0(d)e<“^/V(2w), (1) 

where JZ(r), 0(^) are any normalized functions of r and 6, the com- 
ponent of the angular momentum along the axis of spherical polar 
coordinates is uH. 

In Newtonian mechanics the components of the angular momentum 
My, My, are given by 

( 2 ) 

= «^Pv-yPx- 
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Let US take the 2 -direction as the direction of spherical polar coordi- 
nates. Then we can show in an elementary way that the mean value 
Mg of Mg is nfi, for, by (2) 





dr. 


Transferring to polar coordinates this gives 


and if tjj is of the form (1) this reduces to 

Mg = uh. 


To show that Mg for a particle in the state corresponding to the 
wave function (1) actually has the value uh, we must make use of the 
transformation theory of § 16. We need a series of transformation 
functions gj[j^(x,y,z); then 

I / y< 2) (3) 

give the probability that has the value M'^. These transformation 

functions are defined by the equation 


-(x^ 



-K9 = ^-, 


in polar coordinates this gives 


i dcf) 


-M'gg = 0. 


The normalized solution is 


g == e^“'<A/V(27r), 

where if; = u% 

and the condition that g must be single-valued shows that u must be 
integral. 

Making use of (1), it will be seen that (3) vanishes unless u = u\ 
Thus if an electron is in a state corresponding to the wave function (1), 
the angular momentum has the value uh. 


19.3. The total angular momentum 

In thi8 section it will be sh6wn that if an electron is in a state with 
wave-function , 

i?(r)0(6l)e^«^/V(27r) (4) 

the total angular momentum is 
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The total angular momentum M is given in Newtonian mechanics by 
In quantum mechanics, M% is represented by the operator 


- - 4 ’£ 




-2yz 


dydz 


d d\ 


is therefore represented by the sum of three such terms. We have 
to transform to spherical polar coordinates. This can be done as follows: 
it is easily verified that 

— . /v' . n:' d 


-'■^■+(2*s)’+2*8x 

^x,y,0 ' x,y,z 

— ^2[--f2y2^(i.gjad)2-)-(rgrad)] 


= ¥ - 


[smt 






+ ; 


1 


sin^^ 

The transformation function <l>) is thus given by 

I d^g , 


1 

sini 


■(**1 


With the boundary conditions that g must be single-valued and bounded 
the solutions are P|*'(cos 

with = (^) 

and I a positive integer. It follows as in the last section that, for an 
electron in the state with wave function (4), (the square of the total 
angular momentum) has the value 


19 . 4 . Energy levels in a magnetic field 

We shall now calculate the energy levels of an electron in an atom 
when subjected to a magnetic field, without taking account of the spin. 
The Schrodinger equation is (cf. p. 40) . 

V«0+^®(Agrad0)+^(!f-F)^ = 0. 

The second term is to be treated as a ‘perturbation’, according to the 
/methods of the perturbation theory. 
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For the magnetic fields we take 

This has the vector potential 

= — Wy, Ay = \Hx, A^ = 0. 

Our perturbing term therefore becomes 



or, introducing polar coordinates with the s-axis as axis of coordinates, 

with w = eHl2mc, 

Since the wave functions of the electrons are of the form 
i?(f)0(0)e‘«^/V(277), 

it is clear that all matrix elements of U vanish except the diagonal 
ones, and that gw 

{u'\a\u") = iho^ f 

277 J d(l> 

0 

A state of azimuthal quantum number I is thus split into 21 + 1 states 
with energies displaced by 

^Hu (u=-l,-(l-l) 1). ( 6 ) 

This result was also obtained on the Bohr theory. It is easily verified 
on the classical theory that an electron revolving in an orbit with 
angular momentum I has magnetic moment eII2Tnc. If / is equal to 
Iky the magnetic moment is lekl2mc. If we now assume that the resolved 
part of the angular momentum alon^ the field is also quantized and 
equal touk{u= — Z,..., l—l,l)y then it follows that the resolved part of 
the magnetic moment is uekl2mc. Formula (6) for the energy change 
follows. 

A magnetic moment equal to e^/2mc is called a Bohr magneton and 
denoted by 

We see therefore that 8-states (I = 0) should not be split by a 
magnetic field, apart from the electronic spin to be considered below; 
p-states are split into three states separated by an energy interval fi^H. 

If an atom has two states with energies BJ, transitions between 
which give rise to a line of the emission spectrum of frequency vq given by 
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then according to the above theory the frequency of the line in a field 
H will be given by Vq+Av, where 


Av = 


eH 

^nrmc' 


Au, 


(7) 


A selection rule proved in Chapter X shows that = 0 or ±1- Thus 
any lines (except as transitions which are actually forbidden) should 
split into three lines separated by frequency intervals 

Splitting of this type is known as the Normal Zeeman Effect; it is 
actually shown only for singlet terms, e.g. parhelium. The formula for 
Av does not contain A, and should thus be derivable from the classical 
theory of a vibrating electron. The proof of this is instructive, and is 
given below. 

Suppose that an electron is held in position by a restoring force —pr 
when it is displaced a distance r. Then, in the absence of a field, it 
will vibrate in any direction with frequency v given by 


27rV ^ 

Suppose now a magnetic field H is introduced in the 2 -direction; the 
electron can still vibrate in this direction with its frequency unaltered, 
but the vibrations perpendicular to the field will clearly be affected. 
The normal modes of the vibration are thus circular orbits in a plane 
perpendicular to the field. If r is the radius of such an orbit, we have 
for the components of force on the electron 


restoring force — pr, 

centrifugal force wcoV, 
force due to field ±.flfea>r/c, 
where oj is the angular velocity (= 27tv), Thus 
p—m(M? = ^Hewjc, 

y /p p^ jj 

— A . 

m 2mc 

Formula (6) for the frequencies follows, as does information about the 
polarization of the three components (circular polarization of the outer 
components and disappearance of central component if the emitting gas 
is viewed parallel to the field, linear polarization of all three if the gas 
is viewed perpendicular to it). 


20. The electronic spin 

Many experimental facts have made necessary the introduction of 
the assumption of the electronic spin. These include the gyromagnetio 
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effect, the anomalous Zeeman effect, the results of the Stem-Gerlach 
experiment, and the existence of doublets in the spectra of the alkalis. 
In general it may be said that the number of states of the single-electron 
system (e.g. alkali, silver, or hydrogen atoms) is twice as great as it 
would be without spin. This is especially striking in the case of the 
ground state, which is an 5 -state {I — 0) and should therefore be non- 
degenerate. The original Stem-Gerlach experiment showed, however, 
a splitting of the ground state of silver into two, with a separation given 
by (eMIimc). On these grounds alone a spin must be introduced. 

We may obtain a formula for the component of angular momentum 
as follows. For electrons without spin we have deduced from the 
properties of the Legendre polynomial Pf (cos d) the following properties 
of the. orbital angular momentum: 

(a) The total angular momentum is ^{l(l+l)}fi. 

(b) The component of angular momentum along the axis of coordi- 
nates has the value hu where u has (2^+1) integral values. 

Since experiment shows a twofold splitting of the energy states due 
to the spin, then if s is the quantum number giving the angular 
momentum of the spin, it is reasonable to assume that (25-|-l) = 2 
and thus that « = J. Hence it follows that 

(a) The total angular momentum of a spinning electron is 

^{s(8+l)}h = iV3^. 

(b) The component of angular momentum in the direction of the 

field is where ^ 8,^±\ri, ' 

From the experimental point of view we may determine the com- 
ponent of angular momentum in the direction of a magnetic field H 
by means of the gyromagnetic effect. This measures the change in the 
. angular momentum of a bar of magnetic material when the direction 
of magnetization is reversed; the bar thereby receives a small impulse 
causing it to rotate about the field. If the magnetism is due to orbital 
motion and not to spins, we have for the ratio 
angular momentum __ 2mc 
.magnetic moment e ’ 

For spins the ratio is halved. 

The first experimental measurements of the gyromagnetic effect were 
carried out with ferromagnetic materials; we have here no a priori 
evidence as to whether the magnetism is due to orbital or spin motion. 
The observed values of the ratio (J.2mc/e) show that the magnetism is 
due to spin. 
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20 . 1 . Doublets of the alkali atoms and anornaUms Zeeman effect 

Owing to the spin, all states of the alkali-like atoms eiJ^cept the 5 -states 
are doublets; the 5-states in the absence of a magnetic field are not split, 
and so have a twofold degeneracy. This can be seen in a general way 
as follows: the electron, since it is moving in an electric field c/r^, is 
acted on by a magnetic field of order evfcr^. In this field the energy of 
the electronic spin is of order 

2mc 

which, if r is of the order of the radius of the hydrogen atom (h^jme^), 
gives a separation of the order 

— X total energy, 

or about one part in 10,000 of the total energy. In 5-states, however, 
there is no axis of symmetry for the motion and hence no change of 
energy due to the spin. 

Calculations of the doublet separation can be made (a) on the basis 
of the Pauli-Darwin theory of the next section; (b) on the basis of the 
Dirac theory of the electron (Chap. XI). The latter is, of course, the 
more complete theory and should give a correct result. 

The effect of the spin on the magnetic moment of an atom and hence 
on the Zeeman effect can also be worked out on the 
basis of either of the above theories, for a single 
electron moving in a central force. We give below 
a more elementary derivation due to Land6,t which 
is applicable also in a many-electron atom, where 
several electrons may combine together to give a total 
spin angular momentum Let also L be the total 
angular momentum due to orbital motion, and J the 
resultant angular momentum of the whole atom. 

Then the vectors )S, L, and J will form a triangle as 
shown in Fig. 16 . The component fi of the magnetic moment along J 
will be given by ^ 

fi = (2/S^cosjS+*^cosa), 



since the spin gives two Bohr magnetons for each quantum h of angular 
momentum. We thus have 

^ = ^Jg, 

2mc 


t A. Laiid6, Zeits.f. Phyeik, 19, 112 (1923). 
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where 




1+ 


ScOBp\ 


i 


The quantity g is called the Land4 g^-faotor. Since 


COSp = 


28J 


Putting in the quantum mechanical value for the angular momentum 
in terms of the quantum numbers, we find 


!/ = 


I s{8+l)+j{j+l)-l{l+l) 
2j(j+l) 


( 8 ) 


If the atom is now placed in a magnetic field H, the component of the 
angular momentum along the field will take up the following values 


giving 2j+l terms if j is integral or 2j if j is half an odd integer. In 
either case the energy interval between two adjacent terms is 


Alf = 


ehH 

2mc^' 


(9) 


21. Wave functions for the spin 

Wave functions describing the spin may be introduced in the follow- 
ing way. We take an arbitrary direction in space, for instance the 
«-axis of Cartesian coordinates. If a magnetic field H is appUed in 
this direction, an electron in an atom wiU be found to have changed 
its energy by ±ij,sH, where /ij, is the Bohr magneton (eS/2mc). Which 
value the energy should have is the only information that we may hope 
to obtain about the spin; we therefore introduce a variable < 7 ^, which 

can take only the values ±1 so that the energy of the electron in the 
field is 

is then the component of angular momentum along the field The 
complete wave function of an electron wiU then be 

c,) ((7,= ±l). 

The interpretation of this wave function is that 

-f i)PdT 

IS the chance that the electron described by the wave-funbtion 0 id in 
the volume element and that its spin would give an energy 
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if a field were set up in the zi-direction, with a similar interpretation 
if ( 7 ^ = —1. 

Since the interaction between the spin forces and the orbital motion 
of the electron is small, it is a good approximation to take 0(a;,y,2;; 
in the form //«» m n\ i \ / ii\ 

K= ±i). 

where il;(x,y,z) is the solution of a Schrodinger wave equation for an 
electron without spin, and x((^z) function of the spin coordinates 
only. 

In a magnetic field along the 2:-axis, the spin can have two stationary 
states. If the energy is —fijiH we shall denote the spin wave function 

X.W, 

where = 0- 

If the spin energy is we denote the corresponding wave function 

where xfiW = 0. 

Problems involving the spin are of two types: 


(а) Those involving the symmetry of the wave function in the many- 
body problem. These may be handled by the use of the wave- 
functions Xoii without the introduction of a wave equation. 

(б) Problems involving the interaction of the spin with a magnetic 
field; for these we need to set up a wave equation. 

In the next two sections, therefore, we will show how to build up 
a wave equation for the electron with spin. This equation was first 
obtained, by somewhat different methods, by Darwinf and Pauli. { 
It is a non-relativistic and therefore only an approximation to Dirac ^s 
relativistic equation (Chap. XI). It is nevertheless worth while to 
develop this approximate equation for the following reasons: no satis- 
factory relativistic equation has been given for the interaction of two 
electrons in an atom; also the more elementary theory still has to be 
applied to nuclear particles (e.g. the proton and heavier nuclei) which 
have mechanical and magnetic moments but certainly do not obey the 
Dirac equation. 

The extra term in the energy of an electron due to its spin should 
be, according to classical mechanics, 


(nH)+i((i,[Ev]). 

c 


t C. G. Darwin, Proc. Boy. Soc, A, 116, 227 (1927). 
t W. PbuU, ZeUt.f. Phytik, 43, 601 (1927). 


( 10 ) 
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Putting in the quantum mechanical value for the angular momentum 
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If the atom is now placed in a magnetic field H, the component of the 
angular momentum along the field will take up the following values 


giving 2j+l terms if j is integral or 2j if j is half an odd integer. In 
either case the energy interval between two adjacent terms is 
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21. Wave functions for the spin 

Wave functions describing the spin may be introduced in the follow- 
ing way. We take an arbitrary direction in space, for instance the 
«-axis of Cartesian coordinates. If a magnetic field H is appUed in 
this direction, an electron in an atom wiU be found to have changed 
its energy by ±ij,sH, where /ij, is the Bohr magneton (eS/2mc). Which 
value the energy should have is the only information that we may hope 
to obtain about the spin; we therefore introduce a variable < 7 ^, which 

can take only the values ±1 so that the energy of the electron in the 
field is 

is then the component of angular momentum along the field The 
complete wave function of an electron wiU then be 

c,) ((7,= ±l). 

The interpretation of this wave function is that 

-f i)PdT 

IS the chance that the electron described by the wave-funbtion 0 id in 
the volume element and that its spin would give an energy 
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This is easily proved as follows: the right-hand side of (12) may be 
written 

ijj qWnAM-iq'Wp', q')gA9') 'kk'- 3 ) 

But since the functions g'Jg') form a complete set of orthogonal 
functions, we can expand any function F(g^) in terms of them; thus 

^(?) = I ^n9n(9)> 

n 

where = J ^(9)9t(9) k- 

Thus F(q) = I (■ F{q')g*(q')g,(q) dq'. (14) 

Applying (14) to (13) we see that (13) is equal to 
/ 9*i9)LMg,^.{q)dq, 

or, in other words, to the left-hand side of (12), which is what we set 
out to prove. 

It will be seen from (14) that 

2s'n(3'k(?) = 0 (q¥^q')\ .jg. 

= 00 (? = g') 1 

and that the series behaves like the ‘8-function’, S(x) introduced by 
Dirac. This has the property that 

J f{x')S{x—x'} dx' =fix). 

A further theorem of importance, and the one that we shall use in 
the development of the theory of the spin, is the following; suppose 
a system has the Schrodinger equation 

H(p,q)ij(q)-W^l,(q)==0 (16) 

from which the allowed energy values W are determined. Then if g^iq) 
is any transformation function and 

x(»') = / qiWiQ) k, 

SO that lx(^')l* is the probability that, for the systeiA in the state 
the variable n has the value then (16) is equivalent to 

2(n'lHK)x(n^)-Prx(n') = 0. 


(17) 
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Using the Hennitian relation, that 

(iKHi) = (-iKiir, 

and substituting from (22), we find 
This gives for 

where a is an arbitrary phase. The value of this phase will not affect 
any quantities that we may have to compare with experiment, (t^ may 
be found in a similar way. 

It should also be noted that the three matrices together 

with the unit matrix 

form the basis of an algebra of 2x2 matrices; for any arbitrary matrix 
of this type 

[c d) 

can be written as a linear combination 



of <^i/i Jg. 

Furthermore, if we write Xa XjS matrix form we have 


Xa = Uh X^ = 


so that 


^xXoc- 




1 = 


In this way we may establish all of the relations 
<^xXcc = XP’ <’xXft = X<t 

°vXx = »X(3. OyXfl = -»X« • (23) 

^zXa. = X(«> °zX? = — Xj9 - 

21.3, Pauli's mve eqmiim for an electron with spin 
The wave equation can now be written down. For a point electron, 
for a magnetic fi^ld H and electrostatic potential energy F, it is 


-^VV+F^+^(Agrad0) = W<l,. 

The additional terms in the energy due to spin are (eq. (11)) 

-^(HS) + -i-,(S[Ev]). 

me mc^ ^ 


(24) 
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Replacing v by pjm and S by \hs we have to add lo the left-hand side 
of (24) ^ , H ' \ 

i/f is of course of the form o-'), and 

<r;=l.-l 

With this Hamiltonian we shall solve the following problem. 

Suppose that a beam of atoms in states is passed through an 
inhomogeneous magnetic field (Stem-Gerlach experiment), the field 
being in the zai-plane such that H^ — H cos 6, = H sin d. The beam 
will be separated into two by the field. Let the wave functions of the 
spins in these two states be 

= ^Xa.M+Sxfi{"z)> 

X'fi = 

Our problem is to find A and H, and hence the numbers of atoms that 
would be found parallel and antiparallel to a field if it were suddenly 
switched to the 2 -direction. 

The conditions that the two functions Xa should be normal 

and orthogonal gives 

A*A'+B*B' = 0, 


We thus set 


AA*+BB* = A'A^*+B'B'* = 1. 
A = cosAe^/^, B = sinAe*^ 


A' = —sinAeV, B' = cosAe^^ (26) 

Further, since ^a, x^ represent the two stationary states in the field H, 
the non-diagonal matrix element x*(H<^)x^ vanish. This gives 

{A*A'-B*B')H,+(A*B'+B*A')H^ = 0. 

Substituting from (26) we see that fi = v and 
sin 2A = Hp cos 2A. 

Thus tan 2A = HJH^ = tan S, 

and thus A = ^6, 

Thus the relative numbers of electrons that will point parallel and 
antiparallel to a field suddenly imposed along the 2 -axis are 

oos*J^, sin^J^. 
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22. Evaluation of the doublet separation in alkali atoms and the 
anomalous Zeeman effect 

The Schrodinger equation for a point electron in the absence of a 
magnetic field is ^2 


-^VV+F^= Iff 


(27) 


The perturbing terms due to a magnetic field H along the 2 :-axis and 
to the spin we 

For a given value of 1, there are solutions of (27) all with the 
same energy, and 2(22-f 1) solutions when the spin coordinate is intro- 
duced. These will be split into 2(2^+l) non-degenerate states by the 
perturbing term (28). Fortunately the 2(2/+ 1) unperturbed wave 
functions can be split into pairs iftf, such that the non-diagonal 
elements of U with respect to states in different pairs vanish. Each 
pair corresponds to a given value of the ^-component j„ of the total 
angular momentum ; the wave functions are 

For these = w+ J. It will easily be verified that elements of the type 

(j;ic;|j:)vanishifi;^j:. 

Neither nor is the correct wave function for our state, unless 
a strong field is applied sufficiently great to break down the spin-orbit 
interaction and prient the spin along the g-axis instead of along L\ 
our wave functions will be of the form 

and the contribution w to the energies will be given (cf. p. 73) by the 
quadratic equation 

(Pii-«’)(^!a-M>) = |t7i,p. (29) 

The matrix elements can be evaluated as follows: E is given by 

r, xdV 
r dr 


and thus 
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where M is the orbital angular momentum. Thus, if we write 

00 


CldV 




2mc^ J r dr 


{R[r)}h^dr=P, 


we have for the matrix elements of 


2mW 


(o[EgradJ) 


with respect to the wave functions 

pi u ^{i(i+i)^u(u+l)}\ 

\^{i(i+i)-n(u+i)} ~(w+i) r 

making use of the relations (19) for the matrix elements of M. 
Equation (29) thus becomes 

{w^liH(u+l)-Pu}{w^lxHu+P(u+l)} = P%l(l+l)-u(u+l)}. 

(30) 

If H vanishes, this gives 

w^-\-Pw = PH(l+l), 

whence w — PI or w — — P(^+l)- 


The doublet separation is thus 

(2^+l)P. 

Since P is of order l/c^, we do not expect a non -relativistic theory 
to give a quantitatively correct theory; compare the value given by 
the Dirac equation, Chap. XI, § 56.2. 

Turning now to. the shift of the levels given by a magnetic field, we 
see at once from (30) that a strong field > P) gives 
w — or iiHu. 

In this case the spin-orbital coupling is broken down (Paschen-Back 
effect). For weak fields 


l/^\ 


= (2w-f 1) 


w-\-P 

2w+P 


j— 


or 3 , 


l-V 


^ The Lande gr-factor is thus given by 

g = (l+l)l(l+i)i j = l+h 


It will easily be verified that the usual factor reduces to this form when 
« = J (i.e. for one electron). 
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23. Introduction 

In this chapter we consider the problem of two particles, moving in 
me another’s field and al6o in some external field. Problems in this 
category aret the hydrogen atom when the motion of the nucleus is 
taken into account, the helium atom and the hydrogen molecule, the 
scattering of cx-particles by helium nuclei. The most interesting features, 
of the problem arise when the two particles are identical, i.e. two elec- 
trons or two a-particles ; in this case we shall find that only half the mathe- 
matically possible solutions correspond to states that occur in nature.' 

We shall first develop the theory without taking into account the 
electronic spin. We shall suppose that we have two interacting particles, 
with massep m 2 , and that their coordinates are 

Then we have seen in § 8 that the state of the System will be defined by 
a wave function ^(ri Fg) of the coordinates of both particles, and that 
the interpretation of ^ is that 


is the probability that the first particle is at the point in the volume 
element and at the same time the second particle is in the volume 
element dr^ at the point Fg. The eqpiation satisfied by ^ is 

here P 2 p 2 p 2 


'i 


Vi = 4 I 

* d4^dyyB4’ 

V| = -£!4.-^4. 


and F(FiF 2 ) is tjie potential energy of the system when the first particle 
is at Fi and the second particle at Fg. 

f 

23.1. The hydrogen atom with motion of the ni^leua 
We denote by Fj, Fg the coordinates of the nucleus and electron 
respectively, and by m^y mg their masses. The potential energy of the 
system is where 

r = |ri-rr,l = V{(»'i-!e»)*+(yi-y,)*+(i!i-J!,)*} 
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is the distance between the electron>and the nucleus. To solve* equation 
(1) we introduce the coordinates of the centre of gravity of the system 

R = (%ri+mar2)/(mi+m2) 

and the distance between the particles 

r == Ti-Ta. 

The equation then transforms into 


where 


dt 2M r 

v2 == JL 4. 


and M = wii+wia, . m = ^1^2/(^1+1712). 

The equation is now separable. If we write 

0 = «Ai(R)Wr), 

^ is a solution of the equation if ^1, ^2 satisfy the equations 
8t 2M 


( 2 ) 


(3) 


ih 


, 




The first is the equation for the wave function of a free particle of 
mass M ; the solutions represent the motion of the hydrogen atom as 
a whole. The second equation is the SchrOdinger equation for the 
hydrogen atom; from it we deduce that the energy levels of the hydrogen 


atom are given by 


w = 

2ft* »*’ 


whew m is not the mass of the electron but 

The same result was obtained on the md quantum theory (§9). 


23.2. Rotational and vibraUoncU states of the diatomic molecule 
Suppose the SchrOdinger equation for the electrons in a diatomfc 
molecule to be solved for the case when the distance between the two 
nuclei is r. The energy of the electrons in the lowest quantized state 
will^be a function of r; we may denote it by W(t)* If we add to this 
the potential energy of the nuclei in one another’s field, we get an 
expiession for the total energy that the systemr would. hav|» if the 
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nuclei were held rigidly at rest at a distance r apArt. We denote this 
energy by F(r). For staT)le molecules we expect a curve with a minimum, 
as shown in Fig. 16. 



in considering the rotational and vibra- 
tional states of a molecule, and hence the 
motion of the nuclei, it may be shownf 
that it is a good approximation to treat 
the nuclei according to SchrOdinger’s equa- 
tion as moving under a mutual interaction 
with potential energy F(r). In that case 
the Schr6dinger equation is formally the 
same as that discussed in the last section, 
if wii, are the masses of the nuclei. 
The solution is in fact 


Fig. 16. Energy of a rnoleculo 

as a function of the inter-atomic ^l(^)02(^)» 

distance R. 


where ^i(R) describes the motion of the 
centre of gravity as before, and is of the form 


P|(cos0)eWi:(r)e-^^<'^ 

where L{r) satisfies 


( 4 ) 


r^dr 






0 . 


The tipproximation usually made in describing the levels is that for 
the lower vibrational states only the form of V near the minimum need 
be considered, and so V may be written 




In a crude approximation, assuming the amplitude of the vibrations to 
be small compared with we replace K(K+l)lr'^ by K(K+l)lrl. 
The energy values are then 


, 6 , 

The first term gives the vibrational ^ergy, the second the rotational 


energy. 

These energies are in general small compared with electronic energies, 
containing as they do the large nuclear mass m in the denominator 
eitlier as or m. The interval between the rotational levels is in 
general less than that between the vibrational. 


t M. Bom and J. R. Oppenheimer, Ann. d. Physik, 84, 467 (1927). 
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In the case where the diatomic molecule is built up of two atoms 
of the same kind, as for example in the hydrogen molecule Hg, the 
wave functions ^(rirg) have certain symmetry characteristics which, as 
we shall see in the next section, assume great importance. Suppose we 
interchange the positions of the two particles. Then R, the position 
of the centre of gravity, is unchanged, as is also r, the distance between 
them. By however, defined by 

cobB = {z^—Zz)/ry 

is changed to tt — B, and ^ is changed to 77+^. It will bo seen from the 
definitions of the spherical harmonics that P^(cob0)c^“^ and hence 
0 (ri Tg) changes sign if K is odd but is unchanged if K is even. Thus 
0(1*1 Tg) = 0(r2 Ti) (K even) , 

0(^1 ra) = -0(r2ri) (K odd). 

24. Symmetry characteristics of the wave functions 

It will now be shown that the symmetry characteristic proved above 
for the wave functions describing the positions of the nuclei of a 
molecule is a general property of the wave functions describing non- 
degenerate states, with quantized energy levels, of any atomic system 
consisting of two identical particles. We denote by the group of 
coordinates (a^i, (Ti) describing the position and spin orientation 
of the first particle, and by gg that of the second; if the particle has 
no spin, as for the a-particle for example, the a-coordinate is omitted. 
Then we shall show that for all non-degenerate states either 

0(?i 3'2) = 0(9^2 ?i)> 

or 0(?i?2) = — 0(g^2?i)- (®) 

States of the first type are called symmetrical, those of the second type 
anti-symmetrical. 

The proof is elementary; 0 satisfies a wave equation of the type 

WPiqv '^10(9^1 ?2) = (7) 

The Hamiltonian is necessarily symmetrical; therefore by inter- 
changing the suffixes 1 and 2 we see that 0(^2 ?i) solution of the 
same equation as 0(3'i?2)* ^^0 solution is non-degenerate; in other 

words, only one solution of the equation exists. Therefore 

0(?1?2) = ^0(?2?l). 

where is a numerical constant. Interchanging and q^ on the right- 
hahd side of this equation we obtain 

0(^8 5fi) = ^¥{9'f ?i)- 
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Thus = 1 

and ±1* 

We have thus proved that all solutions satisfy one or other of the 
relations (6). 

We shall now show that if a system is originally in a state described 
by a wave function of given symmetry it will remain in a state of that 
symmetry j that is to say, transitions between symmetrical and anti- 
symmetrical states are impossible. Again the proof is elementary. 
Whatever the perturbing influence to which the system is subjected, 
it must be represented by a symmetrical Hamiltonian; thus if the 
perturbing field is such that the potential energy of an electron in it 
is V(q)i the perturbing terms to be added to the Hamiltonian / are 
V(qi)-{-V{q 2 )- Thus the change with time of the wave function is 
determined by the equation 


where H is a symmetrical Hamiltonian containing the perturbing 
functions. 

It follows that if fft is initially symmetrical, the change that will 
occur in a time interval U is symmetrical; and if ifj is initially anti- 
symmetrical, so is 80. The symmetry of 0 therefore remains unchanged. 

To proceed further, we must appeal to experiment and point out the 
remarkable fact that in nature, in the energy spectrum of a system of 
two particles, only the energy levels corresponding to states of one 
symmetry are observed, for instance in the optical spectrum. For 
electrons and also for protons, both of which have a spin of p, only 
the anti-symmetrical states are observed; for a-particles, which have no 
spin, only the symmetrical states. We shall show below the evidence 
for this statement; we shall discuss first some of its consequences. 

First of all it follows that, in describing pairs of particles in the free 
state as well as in the bound state, we must use* a wave function of the 
required symmetry. If we did not, we could predict the formation of 
quantized states of the type which are not in fact observed. Suppose 
then we perform a measurement on one particle, the results of the 
measurement being describable by a wave function %(g). Suppose we 
perform another measurement on another particle, and describe the 
results of this measurement by v(q). Then we might think that the 
correct wave function to describe the system consisting of the pair of 


particles would be 





§ 24 SYMMETRY CHARACTERISTICS OF THE WAVE FUNCTIONS 107 

where qi is the coordinate of the first particle and of the second. 
We know, however, that this is not correct. If the particles are electrons 
the correct wave function must be 

'^{^Mq2)-n(q^)v{qi). ( 8 ) 

If they are a-particles the sign must be reversed, giving 

At first sight this is rather surprising, because having made the 
appropriate measurement we know that particle 1 is in the state u. 
It is therefore necessary to introduce a slightly different interpretation 
of the wave function gg) which describes the state of two identical 
particles. This interpretation is as follows: 

is equal to the probability that a particle has coordinates lying 
between q[ and q[+dq[ and the other particle has coordinates q^ lying 
between and g^+dgj. No attempt is made to state which particle is 
at which position, and the suffixes 1, 2 refer to two volume elements 
in space, rather than to the coordinates of two particles. 

With this interpretation it is obvious that |0(gig2)l^ must be a sym- 
metrical function of g^gg; and thus that our systen^ of two particles 
must be described either by the anti-symmetrical wave function (8) or 
by the corresponding symmetrical one (9). Thus, unless in our formula- 
tion of quantum mechanics we exclude either the symmetrical or the 
anti-symmetrical functions, the theory becomes ambiguous; we do not 
know whether to describe the "result of our two measurements by (8) 
or by (9). It is therefore necessary to exclude either symmetrical or 
anti-symmetrical functions; to determine which should be excluded for 
particles of a given type we still have to appeal to experiment. 

Particles which must be described by symmetrical wave functions 
are said to obey the Einstein-Bose statistics, and particles which must 
be described by an anti-symmetrical wave function to obey the Fermi- 
Birac statistics. 

For quantized systems, zero-order wave functions can be shown to 
be of the types (8) or (9) by applying perturbation theory as in § 17.3. 
Suppose, for example, that we consider two electrons in an atom. Then 
if the interaction between them is neglected or considered small, we 
can retain the concept of separate stationary states for the individual 
electrons. Suppose that these states have wave functions 0„(g). Then 
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a state in which one electron is in the state n and another in the state 
w' can be described by either of the two wave functions 


or 


both of which give the same energy. The state is therefore degenerate. 
When, however, the interaction between the two electrons is taken 
into account, the energy will in general split; if this occurs, the correct 
wave functions of zero order will be 




That this is the case follows from the proposition already proved, that 
solutions of the SchrOdinger equation for non-degenerate quantized 
energy levels must be symmetrical or anti-symmetrical. It also follows 
from the methods of § 17. If we set for the wave function 

and introduce a perturbing energy F(12), we find that the shift w in 
the energy is given by 

( 10 ) 

so that w = Va±Vi,y 

= / ^*(1)^* (2)F(12)^„(1)0„.(2) dq, dq„ 

Ti = J 0*(1M*.(2)F(12M„(2)^„.(1) dq^dq^. 

It will easily be seen that the two wave functions of zero order corre- 
sponding to these two values of w are those given by (8) and (9). 


24.1. Introduction of the spin coordinates 

The electron (and, as we shall see, the proton) has a spin of p and 
can thus be described by a wave function where takes the 
values ±1 only. For ease of notation the suffix z will be dropped in 
this section. 

If we introduce a strong magnetic field B which will break down 
any coupling between spin and orbital motion, or between one spin and 
another, then the wave functions describing the two stationary states 
of each electron will be the functions already introduced, Xa(^)> 
defined by 

= h XjjW 0 (a = 1), 

= ;c/,((T)=l ((T=-l). 
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A pair of electrons will then have four stationary states, described by 
wave functions / % / s x 

xMxpi<^z)-xMx^i^i) I* 

Xfi(^i)x^(<^2) J 

These states have energies 2/xF, 0, 0, The suffixes 1, 2 refer 

to the spin coordinates of the two particles; as already emphasized, no 
attempt must be made to associate either suffix with a particular 
particle. It will be seen that three of these functions are symmetrical 
and one anti-symmetrical. 

In the same way, if we are setting up wave functions for (e.g. nuclear) 
particles with spin sh, so that for each particle there are 2s -f 1 stationary 
states, there will be (2s+l)® states in all. Of these, in 25+1 both 
particles will be in the same state, and the wave function will be 
symmetrical. Of the remaining 25(25+ 1 ) states, half will be symmetrical 
and half anti-symmetrical. There are thus 


5(25+ 1 ) anti-symmetrical states 

(5 + 1 )(25 + 1 ) symmetrical states 


( 12 )^ 


When the interaction between the spins is taken into account, it 
may no longer be a good approximation to express the spin wave 
functions as products or sums of products of the form (11). Neverthe- 
less, the property proved above must persist; if 5^ is the angular 
momentum of each particle, the system of two particles will, for each 
orbital state, have (25+1)^ states of the spin, of which (5+l)(25+l) 
will be described by symmetrical wave functions Xs(^i^2) 5(25+1) 

by anti-symmetrical wave functions Xa{^i^ 2 )* 

When therefore we come to consider the stationary states with 
quantized energy values of two particles with spin, the position is as 
follows: first of all, the wave functions must be found for two particles 
without spin; these will be of the form ^^(rirg), ^^^('^ 1 ^ 2 )} some, denoted 
by the suffix Sf are symmetrical, others, denoted by the suffix A, anti- 
symmetrical. The complete wave function for the positional and spin 
coordinates may have either of the four forms 

i^^(^i^2)X8{^i^2) I gynametrical, (13) 

^A(^1^2)XAi^l^2) > 

^ 2 )Xa(^i ^ 2 ) I anti-symmetrical. ( 14) 
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The first two are symmetrical, the second two anti-symmetrical in the 
complete group of coordinates (r,(T) that describes the state of the 
particle. Thus either the first two types, or the second two types, must 
be excluded. To find out which, we must make use of the observed 
properties. 

We may anticipate the findings of the next section and state here that 
for protons and electrons only anti-symmetrical functions may be used, 
for a-particles and in general for nuclei with an even number of heavy 
particles (nucleons) as constituents, only symmetrical functions. 

We have seen that transitions between states symmetrical and anti- 
symmetrical in the whole group of coordinates (ra) associated with a 
particle are absolutely forbidden. Transitions between states of the 
two types, for example 

( 15 ) 

*pA(^1^2)Xs(^1^2h 

while not absolutely forbidden, have nevertheless very low intensities. 
The matrix element of any perturbing function F(riorj; vanishes 
with respect to these two functions; it will be shown in Chapter X that 
this matrix element determines the transition probabilities. To obtain 
a finite transition probability, therefore, more accurate wave functions 
than the approximate ones (15) must be used; in other words, the 
effect on the wave functions of the spin-orbit interaction must be taken 
into account. In particular, for nuclei, since the magnetic moment 
of the nucleus is about 1,000 times less than for the electron, the 
probability of these transitions is very small indeed. 

24.2. Application to diatomic molecules 

The considerations of this section are valid only for homonuclear 
mc^oules such as Hg in which the two nuclei are identical. If the 
element consists of a number of isotopes, they do not apply to the case 
where the two nuclei have different masses. 

We denote by r^, Fg the spatial coordinates of the two nuclei, and 
by oTj, (Tg their spin coordinates. The orbital wave functions ^(fiFg) 
have already been found; they are of the form 

i^Ki^i^2) = -PK(co8^)e^«^X(r), 

where (r,^,^) are the spherical polar coordinates of the vector Ti— tg. 
It has already been shown that is anti-symmetrical if X is odd, 
symAetrical if K is even. 

If then we limit ourselves to anti-symmetrical wave functions, we see 
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that for a given orbital state the spin wave functions will give rise to 
(5+1)(25+1) states if Z is odd, 5(25+1) if K is even. If we are limited 
to symmetrical wave functions, these numbers are reversed. Let us 
then consider transitions between two electronic states of a diatomic 
molecule; the energy levels corresponding to the two states will be, 
according to formula (6), 

Wi + Avj ( w + J ) + + 1 )/27i , 

where /j, I^, are the moments of inertia of the two states. The usual 
selection rule for diatomic molecules is 


= 0 or ±1, 


but for homonuclear transitions K—K‘ == +1 is excluded by the argu- 
ments of the last section. Therefore the rotational structure of a band 
is given by the formula 


Jiv = hv^+K{K-^\) 


.2/i 



( 16 ) 


Also, if a gas containing diatomic molecules is in thermal equilibrium, 
there will be differing numbers of molecules in states with odd and 
even 7 l, the numbers being in the ratio 5:(5+l) or (5+l):5. Further, 
this ratio will not be altered when the molecule is excited by electronic 
bombardment or in any other way, since the probability of transition 
from odd to even states is so small. Thus, finally, we see that the 
intensities of the lines given by formula (16) should alternate, odd lines 
and even lines having intensities in the ratio 5/(5+!) or (5+l)/5. 

Turning now to the results of observations on diatomic molecules, 
it is found that for the molecules He2, C2, and O2 the lines with odd K 
are not observed; therefore these nuclei must have no spin and obey 
Bose-Einstein statistics (symmetrical wave functions). Also for Ng the 
odd states have half the intensity that the even states have. There- 
fore the nitrogen nucleus has a spin of unity (5=1) and again obeys 
the Bose-Einstein statistics. 

It should be pointed out that these relations hold only when the 
molecule is formed from the common isotope, e.g. He4, Cja, or 0 ^^, No 
such alternation of intensity is expected or observed for molecules of the 
type O10O17, for example (for which the rotational levels are slightly 
displaced on account of the changed moment of inertia). f 

It is significant that nuclei of even atomic weight always obey Bpse- 


t Cf. R. T, Birge and A. S. King, Phya. Rev, 34, 376 (1929). 
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Einstein statistics, as far as known. This is to be expected if they are 
built up of heavy particles (protons and neutrons) only. 

For hydrogen, lines corresponding both to odd and to even values 
of K occur; but lines for which K is odd have three times the intensity 
of the lines for which K is even. This shows that hydrogen nuclei 
(protons) obey Fermi-Dirac statistics (anti-symmetrical wave functions) 
and have a spin of one-half quantum (sh where « == ^). 

Kesults for many other nuclei can be found in any book on nuclear 
physics, for example the review by Seaborg, Eev. Mod. Phys. 16, 1 
( 1944 ). 

The arguments given above show that transitions between the two 
forms of hydrogen (those with odd and even values of K) will proceed 
at a very slow rate. This means that if hydrogen gas in equilibrium 
at some temperature is heated or cooled to some new temperature, it 
will take a very long time to reach equilibrium. This gives rise to 
anomalies in the specific heat of hydrogen gas at low temperatures. 
The two forms of hydrogen (K even, K odd) are called para- and ortho- 
hydrogen. The one form can only be changed into the other by a 
catalyst which dissociates the molecule and allows the atoms to recom- 
bine after being mixed up.f 

24.3. The helium atom: general principles 

Considering first the orbital motion, and hence the wave functions 
for the spatial coordinates, we expect to have a series of symmetrical 
wave functions with associated energies and anti-sym- 
metrical wave functions with energies W^. The ground state 

will be a symmetrical function. Including spin coordinates, the anti- 
symmetrical functions are of the type 

one spin State, | 
three spin states. / 

If electrons obey Fermi-Dirac statistics (as they do!) we expect there- 
fore to find in helium two sets of energy levels, between which optical 
transitions are extremely weak. One set of these, known as the par- 
helium levels, will have symmetrical orbital wave functions and will 
thus include the ground state; these states have zero spin moment 
(anti-parallel spins) and are not split by the spin. The other set (ortho- 
helium) with anti-symmetrical levels will be split into three states by 
the spm and thus give a triplet series. 

f Of. for example, R. H. Fowler, Statistical Mechanise, 2nd ed. (Cambridge, 1936), 
p. 860. 
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The spin wave functions (17) are wave functions for etatiomry states 
only if a strong external magnetic field is applied, sufficiently great to 
break down the spin-orbital coupling and produce a Paschen-Back 
effect. The spin wave functions in the absence of an external field will 
be combinations of these wave functions. For a mathematical treat- 
ment of this case, a derivation of the triplet separation and of the 
Land6 gr-factor, cf. for example H. Bethe, Handb. d. Physik, 24/1, 
p. 393 (1933). 

Methods of obtaining the orbital or spatial wave functions are given 
in Chapter VI. 

24.4. Collisions between identical f articles 

The symmetry characteristics of the wave function affect the in- 
tensity of scattering when one particle collides with another of identical 
type, for example when a-particles are scattered by helium gas. The 
wave equation for the two particles 
(equation (2)) can as before be solved 
in the form 

where R is the coordinate of the 
centre of gravity and r is the vector 
joining the position of the two par- 
ticles. If we refer the system to 
moving axes with respect to which 
the centre of gravity is at rest, 

^i(R) is a constant; we thus consider two streams of particles impinging 
with equal and opposite velocities, as in Fig. 17. The function 
satisfies the differential equation 

where V(r) is the interaction energy and m* is the ‘reduced mass’, in 
this case half the mass of either particle. As will be shown in Chapter 
IX, a solution ^(r^rg) can be obtoied which for large r has the form 

ikf 

/(O). (18) 

T 

With this unsymmetrical wave function we should interpret 1/(^)1^ as 
the number per unit solid angle scattered from beam A and [/(tt— ^)|* 
as the number scattered in the same direction from beam B. The total 
intensity of scattering would thus be 

wn I 



Fig. 17. Collision between identical 
particles. 


( 19 ) 
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The function (18), however, does not have the desired symmetry. 
Suppose, for instance, that we are dealing with a-particles; for these 
the spin is zero and the wave function must be symmetrical. Since 
an interchange of the coordinates of the two particles changes z into 
—z and 6 into tt—O, the correct wave function is 

cikr 

~ 2co8^s+— [/(0)+/(ir-0)], 

and the total intensity of scattering in the direction 6 is, instead of (19), 
\f(e)+f(n-d)\\ ( 20 ) 

There is no possibility, theoretical or experimental, of distinguishing 
between knocked-on and scattered particles; (20) gives the total 
number. 

It is particularly to be observed that when 6 = 90° formula (20) 
gives just twice as many scattered particles as the unsymmetrical wave 
function (19). 

If the velocity of the particles is not too high, the electrostatic 
repulsion between the particles alone is responsible for the scattering. 
Under these conditions, as shown in Chapter IX, |/(^)|^ reduces to the 
form given by classical mechanics, namely 

and thus the quantum mechanical formula (20) gives just twice the 
classical value at 90°. 

In order to verify this prediction of quantum mechanics that for 
identical particles without spin the scattering would be twice the 
classical, experiments were carried out by Chadwick, f who bombarded 
helium with ^-particles; it is then to be expected that twice the classical 
scattering will occur at 46° (since in the actual system the centre of 
gravity is not at rest). In Fig. 18 a is shown the ratio of the observed 
number of scattered particles between 40° and 60° to the number to 
be expected classically. It will be seen that at low velocities this ratio 
tends to 2; at higher velocities the scattering is determined by the 
structure of the a-particle. 

For Coulomb scattering the function /(^) has the fonuj 
Ze* r 

cosec^ exp log( 1 ~ cos 6) + const. 

The phase factor, which contains Planck’s constant h and is therefore 

f J. Chadwick, Proc. Roy, 8oc, A, 128, 114 (1930). 
t Cf. Mott and Massey, Theoty of Atomic Collmonc (Oxford, 1933), p. 35, 
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a consequence typically of quantum mechanics, does not affect the 
collision between particles that are not identical. As is evident from 



Scattering oF a-parbicles in Helium between 40®and 50" 


Fio, 18 a. Ratio of observed scattering of a^particles in helium to classical Coulomb 
scattering. E is the energy of the particle, and that of the original particle from a 
polonium source. 



Fig. 186. Theoretical value of ratio shown in Fig. 18o. Curve 1, v = 8x 10®, 
curve 2, = 4x 10®, curve 3, v = 2x 10® cm./sec. 

formula (20), however, it does do so in the case of identical particles; 
the ratio of the scattering given by (20) and by the (classical) formula 
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(19) is plotted in Fig. 186 for values of the velocity equal to 2x 10®, 
4 X 10® and 8 x 10® cm. per sec. The smaller the velocity, the more rapid 
are the oscillations. 

In order to verify the existence of these peaks a detailed statistical 
investigation was made of the scattering of a-particles by helium in an 
expansion chamber by Blackett and Champion. f The results fully 
confirmed the existence of the minimum round about 35° for t; ^ 2 x 10® 
cm. /sec. 

If the colliding particles have spin (protons or electrons) the treat- 
ment is rather more complicated. In this chapter a discussion will be 
given only of the case where the particles have spin J (angular momen- 
tum \%) and obey Fermi-Dirac statistics. 

If the particles have opposite spins, they may be treated as distin- 
guishable; the number of particles scattered in the direction d will be 
given by (19), namely 

( 21 ) 


If they have the same spin, the wave function in the spatial coordinates 
must be anti-symmetrical; the number of particles scattered will there- 


fore be 


\f(d)^f(n^e)\^. 


( 22 ) 


Taking the mean of the two, the scattered intensity is 


aimn m^-e)n 

This formula may be obtained in a slightly different way. Suppose 
that a measurement is made which shows that one electron is in the 
neighbourhood of the point A, with wave function ^^(r) and with spin 
wave function Xa(®)j second electron has wave functions ^^(r), 

Then the wave function describing the pair of particles is 


If the spins are in the same direction (a = ^8), then it follows that the 
spatial wave function is anti-symmetrical and that the scattering is 
given by (22). If they are in opposite directions it may be written 

t P. M. S. Blackett and F. C. Champion, Proc. Roy, Soc, A, 130, 380 (1931). 
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From this it follows that the scattering is given by 
which is identical with (21). 


24.5. The Pauli principle 

The Pauli principle, as originally formulated, stated that no two 
electrons in an atom could have the same set of four quantum numbers. 

The Pauli principle follows from the more general statement that the 
wave function if/(T^cr^, Tgo-g, rgorg...) of all the electrons in an atom, or 
in any other system, must be anti-symmetrical in the coordinates of any 
two electrons. When the interaction between the electrons is small 
enough for it to be possible to represent the orbital wave function by 
a product of functions of the coordinates of individual particles, the 
Pauh principle results. This may be seen as follows. 

It is simplest to consider an atom in a strong magnetic field suffi- 
cient to produce a Paschen-Back effect, so that there is no coupling 
between the spin and the orbital motion. Suppose that the n individual 
electrons are in orbital states a, 6, c,..., n with wave functions 
0ft(r).... If the total spin along the field is shy this means that 
spins are oriented along the field and \n~8 in the other direction. 
Thus there are n\j{\n--s)\(\n+s)\ ways in which a spin wave function 
Xa(cr) or can be associated with an orbital wave function 
Let the series of functions 


^a(r)Xa(«^)> - (23 a) 

represent one of these ways; the suffixes a, by c, must when appended 
to X denote a or Then, as long as none of these wave functions is 
the same, we can form an anti-symmetrical wave function in the form 
of the determinant 


M^l)Xa{^l) M^2)Xai^2) • • • 

M^l)XbM M^2)Xb{<^2) . . . . 


(236) 


If, however, any of the individual wave functions (23 a) are the same, 
(23 6) vanishes and no anti-symmetrical wave function is possible. This 
proves the Pauli principle. 

The function (236) is not the correct zero-order wave function of the 
system; to obtain this we must add together all of the n!/(|w~a)!(in4-5)l 
functions of the type (23 6) which do not vanish, with coefficients which 
havb to be determined. Methods by which this cah be done are 
described in Chapter VI. 
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25. The helium atom: methods of obtaining wave functions 

In this section we discuss only the wave functions in the spatial 
coordinates; we have therefore to solve the SchrOdinger equation 

( 2 .) 

As has been shown already, we expect to obtain a series of symmetrical 
wave functions fg) with corresponding energy values, which with 
the introduction of the spin remain singlet terms, and a series of anti- 
symmetrical wave functions ^^{rirg), which become triplet terms. 

25.1. Perturbation method of obtaining energy levels 
In order to obtain approximate values for the energy levels it is 
convenient to consider both electrons moving in some field F(r); the 
Schrddinger equation 

^¥+^{W^-F(r)}^ = 0 (25) 

is then solved, and characteristic energies and wave functions 
obtained; wave functions of zero order for the helium atom are then 
taken to be , 

with energy 1^+1^'; and the remaining term in the potential energy 
in (24), 2^2 0-2 p 2 

is treated as a perturbation and the change in the energy is calculated 
by the methods of Chapter III. In his original paper on the helium 
atom, Heisenbergf took for Y(r) 

l^(r) = e^lr^ (r < r^) 

I = c2/r (r > ro), 
where r^ is some suitably- chosen radius; clearly each electron will move 
in the field of a doubly charged nucleus for small r and that of a singly 
charged nucleus for large r. This form of F(r) is not, however, the best 
possible choice (cf. § 25.2). 

The change in the energy due to the perturbing term ^/^(rira) is then 
(we write ^(1) for ^(r^)) 

t w. Heisenberg, ZeUa.J. Phyaik, 39, 499 (1927). 


r 
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which reduces to where 

•4«- = If ^(rir,)l^jm,.(2)l^dT,dT, 
and Z„„, = JJ U(t^r,)Um(l)Ummdr^dT,. 

The integral K is called the ‘exchange’ integral. The ground state 
of helium has the wave function of zero order 0 q(1)^o( 2) and there is 
thus no splitting of the energy value when interaction between the 
electrons is taken into account; when, however, one electron is excited 
the pair of wave functions of zero order 

are possible, with energies to the first order 

The terms with the positive sign form the series of levels (singlets) which 
include the ground state. 

25.2. Choice of the field V(r); the Hartree method of the selficmsistent 
field 

It has been shown by Hartreef that the best field V(r) for the calcula- 
tion of the wave functions of zero order, at any rate for the ground 
state of helium, may be obtained as follows: if 0(r) is the wave function 
for the ground state of an electron moving in the field F(r), and thus 
the solution of (25), V(r) is given by 

F(r) = J (26) 

This field is termed the self-consistent field; it can only be obtained by 
a series of successive approximations, since ^ cannot be calculated until 
an estimate of F is made, which must then be modified using the value 
of ift obtained. 

It will be seen that F(r) is the potential energy of an electron in the 
field due to the nucleus and to a charge distribution (due to the other 
electron) of density — e|^(r)|^ at any point at a distance r from the 
nucleus. 

The self-consistent equation is thus 

VV+|^{ir-F(r)}^ = 0. 

t D. B. Hwtree, Proc. Comb. Phil. Soc. 24, 89 (1928). 
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It will be noticed that W or 2 fV is not directly related either to the ioniza- 
tion potential or to the total energy of the atoms. The total energy 
of both electrons (minus the work required to remove them both) is 

It is important to realize that the Hartree approximation does not 
entirely neglect the influence of one electron on another; for instance, 
in helium the field V(r) behaves as —2e^lr near the nucleus and as 
— e2/r at large distances, owing to the screening effect of the other 
electron. On the other hand, if one electron is on the right-hand side 
of the atom, the other electron is more likely to be on the left than on 
the right because of the repulsion between them. This fact is neglected 
in Hartree ’s approximation; to take it into account terms involving 
Irj— Fgl must be introduced into the wave function. 

The proof that the self-consistent field gives the best approximation 
for the ground state requires the following lemma: 

Consider a Hamiltonian function from which a SchrO- 

dinger equation can be set up. Let ...) be any function of the space 
coordinates q satisfying the normalizing condition 

J 0*^ dq = 1. (27) 

Then, if the integral ^ ~ J (28) 

is formed, the lowest energy value Wq corresponding to a characteristic 
solution of the SchrOdinger equation 

(H-W)ilf = 0 

is equal to the lowest possible value of the integral (28) for all possible 
choices of ^ which obey (27). 

The proof is as follows: suppose we vary ^ by adding to it a small 
arbitrary function 8^ of q. Then since the integral (28) is a minimum, 
the change in its value will be zero to the first order. Thus 

J dq+ J ^dq=0. 

By the Hermitian property (cf. Chap. I) the two integrals are equal; 
we thus obtain 

This is valid for all functions 8^ such that, by (27), 


= 0 . 
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Using the method of undetermined multipliers, we see this is the case 
for a certain choice of a parameter A if 

J = 0 

for all functions 8^*. It follows that 


Htp—Xilt = 0 , 

which gives us just the Schrodinger equation. 

The other solutions of the Schrodinger equation also give minimum 
values of (28), i/r being varied in such a way as to keep the number of 
nodes or nodal surfaces in the wave function a constant. 

The self-consistent or Hartree solution can now be shown to be the 
one which gives the minimum energy, and hence the best approxima- 
tion to the wave functions when (in the case of an atom with two 
electrons) the wave function ?/>(ri, Fg) is taken in the form of a simple 

mmy 

For if we substitute (29) in (28) above and vary 0 by 80 we obtain 


for any variations of 0(r) which satisfy 

J 0(r) 80(r) dr = 0. 

Using as before the method of undetermined multipliers, we obtain the 
following differential equation to determine 0(r): 

(J = 0. (30) 

The term in the curly brackets gives 

J ^(2)fr^(2)rfr,= -^Vf+F(ri)-U, 

where V(r) is defined by (26) and 


A = 



Equation ^0) thus becomes 




where w = W—A. This is Hartree ’s self-consistent equation for tfi. 
It 'will be noted that the total energy of the two electrons is given by 
w = W—A. 
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25.3. The Fock equation 

If, in the approximation of zero order, the two electrons are not in 
the same state and thus if an excited state of the helium atom is under 
consideration, the wave functions derived by Hartree’s method do not 
give the best possible approximation. If the wave functions concerned 
are 0o(r), ijjjr), Hartree's method would consist in making the integral 

a minimum. A better value for the energy will be obtained by using 
symmetrical or anti-symmetrical wave functions for the terms of ortho- 
and parhelium respectively and minimizing the integral 

This was first pointed out by Fock,f and the resulting equations are 
called Fock’s equations. These equations are, as may easily be shown 
by the methods of the last paragraph, 

where 

As in Hartree’s equations, a solution can only be obtained by a method 
of successive approximations. An attempt to solve the equations for 
the excited s-states of helium has been given by L. P. Smith.J 
Further consideration of Hartree’s and Fock’s methods for atoms 
of more than two electrons is given in the next chapter. 


26. The deuterpn 

A two-body problem of some importance in theoretical nuclear 
physics is that of determining the energy levels of the deuteron. The 
deuteron, consisting as it does of two elementary nuclear particles (or 


aI Se® J. c. Slater, loo. cit., and BriUouin, 

1934). who gives a detailed account of the two equations 
and of the relation between them. 

t L: P. Smith, Phya. Rev. 42, 176 ( 
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nucleons), one proton and one neutron, plays in nuclear physics a role 
similar to that assumed by the hydrogen atom in atomic physics. 
Further, since any two-body problem can be integrated explicitly if 
the force between the two particles is a known function of their distance 
apart, calculations on the deuteron are more suitable for quantitative 
comparison with the experimental results than any others in nuclear 
theory. 

In order to deal with the two kinds of nucleons it is necessary to 
introduce a convenient formalism; this was first achieved by Heisen- 
berg, f who introduced the isotopic spin variable t analogous to the spin 
variable s. Just as in the latter case 5=1 represents a particle with 
a spin in the direction of the s-axis and 5 = — 1 a particle with spin 
in the opposite direction, t = 1 represents a neutron and ^ — 1 

represents a proton. In precisely the same way there are isotopic spin 
operators t = (t^, t^) corresponding to the Pauli spin operators 

® ^z) defined above. 

The wave function of the deuteron is thus a function not only of 
the space coordinates of the two particles but also of the spin 

coordinates 5<^^, 5 ^^ ^and the isotopic spin coordinates Thus 

ijj = 5 ( 1 ), P\ r<2), 5<2), ^2)). (31) 

For the deuteron the wave function ^ may be written as a product 
ijf = r<2));j^(5(i), 5<2); P\ (32) 

where ^ is a function of the space variables and x ^ function of the 
spin and isotopic spin variables. For nuclei containing more than two 
nucleons it is not possible to separate the space coordinates from the 
spin and isotopic spin variables in this way. 

To fulfil the Pauli exclusion principle iff must be an anti -symmetrical 
function. Thus either ^ is symmetrical and x is anti-symmetrical or ^ is 
anti-symmetrical and x is symmetrical. In the ground state we would 
expect the former of these conditions to hold. If we denote the spin 
wave functions by Xai XjS* above, and the corresponding isotopic 
spin functions by x^, x^, then for the triplet state we have 

xl’-S) = Xa. x«.(x«, Xj8,-Xj5. Xa.). (33) 

and for the singlet state 

x(‘'S) = (x«^X|J.-Xft x«.)(x«. Xj5.+Xj5, Xa,)- (34) 

Using the results (23) of Chapter IV we have 

‘»‘“x«»= (X|3..+»Xj},.Xa.). 
f W. Heisenberg, Zeits.f. Physik, 77, 1 (1932). 
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and similarly for 

(o<«.o(»)x„.X(«. = = XcciXa.- (35) 

The t’s and x’s satisfy the same relations as the (t’s and x’s, i.e. 

’•*X« = X/3. T-yX«=+»Xp. 'rzXoc = X<x 

T®X/9 = Xa. '^uXp = -»Xa. T^XjS = -X/3 
SO that (t<‘>.t<»)(x„,x^.-X^.xJ = -3(Xa,-Xft-Xj3.-xJ- (3'^) 

Now according to the Moller-Rosenfeld meson theoryf of nuclear 
forces the interaction between two nucleons is given by 

J7= (T<«.T<»)[C?*+(«»'i>.o'®)i^^— (38) 

^12 

where r ^2 denotes the distance between the two particles, F and G are 
two constants with the dimension of electric charge, and A is a constant 
related to the mass, w, of the meson by the formula 




If M denotes the mass of a nuclear particle, then transforming the 
SchrOdinger equation to coordinates in which the centre of gravity is 


at rest, we obtain 


AW-U)^: 


as the wave equation for the deuteron. 

For the triplet state we have from equations (35) and (37) that 

T 

and similarly for the singlet state it can be shown that 

Thus in both cases we may write for the state (I = 0) of the deuteron 
Writing 

r = plX, B = 6|2jfic, W^al^^mc^, ^ = ® 
we obtain the equation 


• |a-f-6— jo = 0. 


f C. Heller and L, Rosenfeld, D, Kgl. Danske Vidensk, Stlslcabt Mek.-fys. Medd, 17, 
Ns. 8 (1940). 
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The differential equation (41) satisfied by the radial part of the wave 
function for the ground state of the deuteron has been studied in detail 
by A. H. Wilson.*!* The problem now is to determine a when b is known. 
Assuming a solution of equation (41) of the form 

0 = pe-i% (42) 

so that d^ldp^ is equal to we have to determine the energy 



0 0 


00 00 

Now J 4)*<D dp = 1, I -~(^*dp : 


0 0 
so that we obtain the relation 


(l+a)2’ 


a = — - 


boL^ 




2(l+ocf 

To obtain the best value of a we minimize a with respect to a. In this 
way we obtain the parametric equations 


a = 


a^(a — 1) 


6 = - 


(oc+l? 

a(a-t-3)^ 


(43) 


giving the relation between a and b, Wilson compares this solution 
with that obtained mechanically by means of a differential analyser. 
The results are summarized in the first three columns of the table below. 


a 

b 

Analyser 

Wilson 

Hulth^n 

000 

- 1*70 

- 2-00 

- 1-68 

-- 0-26 

- 2-73 

- 2*81 

- 2*77 

- 0-50 

-316 

- 3-23 

- 3-20 

- 0-76 

- 3*48 

- 3-64 

- 3-52 

- 1-00 

- 3-76 

- 3-83 

-382 


The second column gives the value of 6, corresponding to the value 
of a on the left, as determined by the differential analyser. The third 
column was constructed by calculating b from equation (43) for the 
value of a necessary to give the prescribed value of a. It will be observed 
that these values of 6, lie close to each other except in the neighbour- 
hood of a = 0. Wilson also computed the relations between a and b 
arising from taking a wave function of the form 

t A. H. Wilson, Proc. Camb. Phil 8oc. 34, 366 (1938). 
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containing two parameters a, j3; but he found there was no marked 
improvement on that obtained from the wave function (42). 

More recently Hulth^nj has employed an approximate wave function 
of the form o = (44) 


where y is a parameter. The values of h resulting from this wave 
function are given in the final column of the table. The agreement with 
the exact solution obtained from the differential analyser is very close 
in this case (cf. Fig. 1 of a paper by FrOhlich, Ramsey, and Sneddon). J 
In calculating the binding energies of more complicated nuclei 
Hulthfen’s wave function (44) is not much more accurate than Wilson’s 
approximation (42) and the latter is easy to handle analytically. By 
means of these calculations and a similar one for it is possible to 
calculate the constants F, Gy m of the MoUer-Rosenfeld theory, § and 
to show that it leads to results in agreement with the observed angular 
dependence of the neutron-proton scattering. J The value of the meson 
mass obtained in this way is about 22(^ electron masses, while cosmic- 
ray observations give a mean value of ibout 200 electron masses.!! 


t L. Hulth^n, Arkivf. mat. astr. o.fya. 28, Nol 6 (1942). 

X H, Fr6hlich, W. Ramsey, ^id I. N. Sneddon, Rep. Int. Cong, on Fundamental 
PartkleHy Cambridge (1946), p. l66. 

§ H. Frdhlioh, K. Huang, and I. N. Sneddon, Proc. Roy. Soc. A, 191, 61 (1947). 
j] W. F. G, Swann, Reports Prog. Physics^ 10, 46 (1944-.5). 



VI 

ATOMS WITH MANY ELECTRONS 

Introduction 

In the last chapter we discussed the quantum mechanics of monatomic 
systems next in order of complexity to the hydrogen atom, namely 
those in which two electrons move in the field of a heavy nucleus. 
We shall now consider some of the methods employed to extend the 
analysis to more complex monatomic systems, in which more than two 
planetary electrons revolve round a heavy nucleus. One of the most 
important methods of attack on this problem is omitted here because 
it requires a knowledge of the theory of groups. Accounts of the 
applications of the theory of groups to the discussion of complex atomic 
spectraf should be consulted for details of this method — one of the 
most powerful for the derivation of general results. 

We shall be concerned here mainly with an account of the various 
methods of calculating the energy levels and wave functions of complex 
atoms. In many problems a knowledge of the wave functions of complex 
atoms is necessary for the determination of physical quantities such 
as the radial charge density; an account is given of methods of obtaining 
the wave functions numerically and of approximating to these wave 
functions by means of analytical expressions. 

27. The wave functions of many-electron systems 
27.1. The Schrodinger equation in atomic units 
In order to avoid the frequent repetition of the constant 2mjP it is 
convenient to express the SchrOdinger equation in the atomic units 
first introduced by Hartree. J In this system of units the unit of length 
is chosen to be the radius of the lowest orbit in the Bohr theory of the 
hydrogen atom, Tq — ¥]me^. Taking the unit of charge to be the 
electronic charge, e = 4-8025 x 10“^® e.s.u., and the unit of mass to be 
the rest mass, m, of the electron (9-1067 x 10“^® gm.), we find that has 
the value 0*629 x 10“® cm. 

From these quantities we derive the unit of energy 

€ = e®/rQ = me^Jh^ = 4-364 x 10~^^ ergs = 27-20 eV, 

t See, for example, H. Weyl, Qruppentheorie und QuarUenmechanik (Leipzig, 1928; 
Eng. trans. London, 1931); C. Eckart, Rev, Mod, Phya, 2, 305 (1930); E. Wigner, 
Oruppeniheorie (Braunschweig, 1931) ; B. van der Waerden, Qrujyjientheoretiache Methode 
in der Qtumtenmechanik (Berlin, 1932). 
t D. B. Hartree, Proc, Camb. Phil. Soc. 24, 89 (1928). 
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The energy c is twice the ionization potential of the hydrogen atom in 
the ground state. The unit of velocity is automatically fixed by the 
relation JmvJ = c once the units of energy and mass are fixed. It turns 
out to be the velocity of the electron in the ground state of the hydrogen 
atom (again, in the Bohr theory); i.e. we have as the unit of velocity 

= e^lh == c/137-0 = 2-188 x W cm./sec. 

The unit of time is then chosen to be 

<0 = tJvq = 2-4188 X 10“^^ sec. 

In terms of atomic units the SchrOdinger equation reduces to the simple 

■ V^+2(W-V)^ = 0 . ( 1 ) 


27.2. The wave functions of many-electron atoms 
Employing atomic units we may express the Schrodinger equation 
for a neutral atom containing Z electrons in the form 


(iJv/+(ir-F))^ = o, 

where V| denotes the Laplacian operator 


02 02 02 


( 2 ) 

(3) 


(Xj,ypZj) being the rectangular Cartesian coordinates of the jth electron 
referred to the nucleus as origin. For the potential energydn the same 
units we have 

(4) 


z n z j 

''“-2r+2r 


where denotes the distance of the jth electron from the nucleus and 
the distance between the jth and kth electrons. The Schrodinger 
equation (2) can seldom be solved exactly, but by making certain 
assumptions about the nature of the potential energy V approximate 
solutions of equation (2) may be obtained. Of these assumptions the 
simplest is that it is possible to construct a potential function of the 
type ' z 

F=^2F(r^), (6) 

which is a good approximation to the actual potential function V of 
equation (4). Tins is equivalent to considering the mutual repulsions 
of the dleotrons to be averaged out in such a way that each electron 
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moves in a static field which is independent of the positions of the other 
electrons. Substituting from equation (5) into equation (2), we obtain 

im-v(T0+w^^o. (6) 

An equation of this type may be solved by setting for ijj a product of 

n wave functions, 0(r^.), each of which is a function of the coordinates 
of one electron only. Thus if is a solution of the wave equation 

m+{W^-VirMl{rl) = 0, (7) 

and if ^^(rg), are defined similarly, it can readily be 

shown that the product 

is a solution of the Schrttdinger equation (6) corresponding to the energy 
W = W,+W,+...+Wz. 

If we now introduce the electron spin, we may associate either of 
the two spin wave-functions spatial wave- 

functionS ^(r^) so that the spin is considered as doubling the degeneracy. 
To denote that the wave functions are dependent on the four variables 
X, y, 2 , a for each electron we introduce a vector to denote the four 
quantities Tp op The wave functions corresponding to the jth electron 
are then written symbolically 

m) = ( 8 ) 

Thus if we include the electron spin there will be at least two (and, in 
the case df degenerate functions several) wave functions ^ corre- 
sponding to each of the eigenvalues Wg,..., If*,... of the parameter W 
of equation (7). The ^ wave functions corresponding to the eigenvalue 
Wf^ of the energy may be denoted by where the parameter fi ranges 
over the number of values appropriate to the degree of the degeneracy. 
A solution of the wave equation (6) is obtained by choosing any Z of 
the functions substituting in them the coordinates of the electrons, 
and forming the product. Thus, if from the class of functions we 
select a finite set which we shall denote simply by 

then defined by 

= ^i(Qi)^2(Q2)—^z(^z) (^) 

. is a solution of the wave equation (6) corresponding to the energy 

W^W,+W^+...+W^. (10) 

K 


6072 
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There are, of course, several ways in which the energy W can be 
distributed among the electrons, for instance, the wave-functions 

and ' = Ufli)<l>Mt)"4z{flz) 

are quite distinct, although they correspond to the same value of the 
energy. In fact it is easily seen that there are Zl distinct wave functions 
corresponding to the same value of the energy. If P denotes any 
permutation of the suffixes 1, 2,..., A;,..., Z, then the wave functions 
of such a set may be written symbolically as 

Furthermore, if ap is any constant, then the wave function 

O = ^ ^ > 

where the summation is taken over the Z\ different permutations P, is 
also a solution of the wave equation (6) for the same value of the 
energy W, Taking ap to be + 1 for even permutations and — 1 for odd 
permutations we obtain the solution, anti-symmetrical in the g’s, 


^i(qi) 

^2(^1) 


^i(qa) 


• • <t>zi<li) 

<l>Mz) 

^a(qz) • 

* • 


This is the form that has to be used in the solution of problems involv- 
ing atoms and molecules. The interchange of two electrons leaves 00* 
unaltered, as it must since it is impossible for us to distinguish between 
two electrons, and O vanishes, in accordance with the Pauli exclusion 
principle, if two of the electrons have their q^s equal. f A wave function 
Witten in this way to include space and spin coordinates alike has 
many advantages over wave functions constructed in the space coordi- 
nates and the spin coordinates separately, especially in that it avoids 
many applications of the theory of groups which were a necessary 
consequence of this method of attack. 

Some authors write the determinant (11) in the form 
O = Det{^i(qi)^2(q2)...^2(q^)}, 
exhibiting only its leading term. 

There may be several wave functions of the type (11) with the same 

t P. A. M. Dirac, Proe. Rog. Sof. A, 112, 661 (1626). See also J. C. Slater, Phya'Rev. 
34, 1293 (1629). 
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energy. For example, the excited states of a two-electron atom may 
be described by wave functions of the type 

^mWXa(<'2) 

^ ^ 'l’mih)xM >l>n(Xl)Xfii<^l) 

'l>mi^2)xJ<’2) fAn(r2)Xj3K) 

^ ^ !^m(ri)X|3K) <l>n(.^l)xM 

each corresponding to the energy 

As long as we assume the potential to be of the type (5) any linear 
combination of the O’s will be a solution of the SchrGdinger equation. 
But if we use the correct form (4) the degeneracy will disappear either 
wholly or in part, and a first order approximation to the solution wall 
be given by particular linear combinations of the O^s. In the case of 
the helium atom, for instance, the approximate wave functions are 
or any linear combination of <^i, O 4 , and Og+Og. 

27.3. The electron distribution function of a many-electron system 
In the non-degenerate case the approximate solution of the Schr 6 - 
dinger equation is a single determinant. In the degenerate case the 
approximate solutions are a restricted set of linear combinations of 
determinants. Even in this case, however, it is usually possible to 
choose some of the solutions to be single determinants. In this section 
therefore we shall consider the electron distribution when the solution 
is a single determinant. 

If the wave function 0 of a many-electron system is expressed as 
a determinant of the form ( 11 ) and, as is consistent with the results 
of §27.2, the functions ^ are orthogonal to each other and normalized 
to unity, then the probability of finding one electron in the volunae 
element another one in the volume element and so on, is 
pdgidga-.dgjj. 

Here dq = drda includes both space and spin coordinates and 




• mi) 


^i(qi) • 

• ^z(Qi) 

o*® = 

^f(q2) • • 

■ mz) 

X 

^1(^2) 



0?(qz) • 
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are quite distinct, although they correspond to the same value of the 
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* • 


This is the form that has to be used in the solution of problems involv- 
ing atoms and molecules. The interchange of two electrons leaves 00* 
unaltered, as it must since it is impossible for us to distinguish between 
two electrons, and O vanishes, in accordance with the Pauli exclusion 
principle, if two of the electrons have their q^s equal. f A wave function 
Witten in this way to include space and spin coordinates alike has 
many advantages over wave functions constructed in the space coordi- 
nates and the spin coordinates separately, especially in that it avoids 
many applications of the theory of groups which were a necessary 
consequence of this method of attack. 

Some authors write the determinant (11) in the form 
O = Det{^i(qi)^2(q2)...^2(q^)}, 
exhibiting only its leading term. 

There may be several wave functions of the type (11) with the same 

t P. A. M. Dirac, Proe. Rog. Sof. A, 112, 661 (1626). See also J. C. Slater, Phya'Rev. 
34, 1293 (1629). 
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from the determinant. Carrying out the integrations with respect to 
the coordinates of the other electrons we obtain the relations 

J P ~ ^Pz-2i 


j p dqs‘“^z — (^^^~\~^V-P8-v 


j pdqi...dqz = (Z-l)lpii. 

As previously explained, the probability of finding any one of the 
electrons in the volume element dr^ is given by the sum of the distri- 
bution functions corresponding to the one-body problem, i.e. 

27 A. The energy of a many-electron system 

The Schrttdinger equation for a system of Z electrons moving in the 
field of a nucleus of atomic number Z may be written in the form 

(J[/-PF)O = 0, (16) 

where, in atomic units, 



and (18) 

If we multiply both sides of equation (16) by O*, substitute for <I) from 
equation (11), and integrate over the whole of the configuration space 
(i.e. over spin as well as spatial coordinates), we obtain the relation 

W ( pdT=yf Hjp&r+y ( ^dr, (19) 

•’ i ^ 

where p is defined by equations (12) and (13). It should bo noted that 
in this equation the operator must be taken to act on py but not 
on p^i\ this is a consequence of the fact that H^p is formed from the 
product in which operates on 0 but not on 0*. The integra- 

tions can be performed by methods similar to those outlined in the 
preceding section. For example, we have 

dqjdq^. (20) 

PJc) Pkk 
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From the definition of the quantities we see that the right-hand side 
of this equation is equivalent to 

- 2 2 f (21) 

Since only certain of the orbital wave-functions (l>x will have the same 
spin factors, they alone will give a non-zero result when the integrations 
over the spin variables are performed; the remainder are orthogonal 
and the integrals involving them vanish. Thus, if the spin wave func- 
tions are normalized to unity and if we write for the integrals over the 
spatial coordinates 

(ah\0\cd) = f (22) 

we find that the first term of (21) may be written 

y (A/iiGM. 

In the second term of (21), on the other hand, certain of the terms will 
vanish; it reduces to ^ 

8 iAWW), 

where the summation symbol 8 implies that the sum is taken over 

A.fi 

only those pairs of numbers A, p, for which the corresponding orbital 
T^ve functions <l>x and have the same spin factor. Furthermore, 
since there are JZ(Z— 1) lines passing through Z points we have that 


so that finally 


2 r^=iz(z-i) 

^ik j fjk 


2 J ^ ^ J (V|G|A/^)- « (f^lGM). (23) 

The integrals {ah\G\cd) occurring in equation (23) and defined by 
equation (22) are Imown as generalized exchange integrals. 

The integrals involving p and Hj only may be evaluated in a similar 
fashion; we obtafti r 


^jH^pdq = Z\^WHWj' 
(Ai^riA) = f 
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dTj denoting as usual the volume element of the spatial coordinates. 
Substituting from (23) and (24) into equation (19) and dividing through- 
out by Z! we obtain finally for the total energy of the system 

W =J^(A|fllA)+j(g(V|GlV)- (26) 

where the summation 8 is taken over those pairs of spatial wave 

functions which have the same spin factors in 0. If can be 

calculated from equation (26) only if the orbital wave functions of the 
system are known. Equation (26) shows, however, that the energy of 
a many-electron atom is made up of three components: 

(i) the sum of the energies of each electron in the presence of the 
nucleus alone; 

(ii) the sum of the Coulomb interactions of electrons in one orbital 
with those in another; 

(iii) the sum of the exchange interactions between two electmiis in 
different orbitals but possessing the same spin. 

28. Variational methods of determining wave functions and 
energy values of many-electron atoms 
28.1. Fock's variational equations 

The SchrOdinger wave equation (16) may easily be shown to be 
equivalent to the variation principle 

8 JO*(Zr-TF)O(ig = 0, (27) 

8 denoting an arbitrary change of O subject to the normalizing condition. 
The integral is taken throughout the configuration space of all the 
coordinates (including those describing the spin) of all the electrons. 
By the methods of the calculus of variations we can establish that 
equation (27) reduces to 

|8O*(^-Tr)«D(ig = 0, 

which yields the variational equation 

W J dq = j 80*£r0 dq. (28) 

In the many-electron problem 0 is a determinant of the form (11), 
so that equation (28) takes the form 

= ZlWljm^dr, (29) 
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where p is defined by equation (13). As before, the quantity in 
equation (29) operates on the coordinates of the jth electron when they 
appear in terms such as but not when they occur in terms of the 
type pji. Contrariwise, 8^- operates only on p^j and not on With 
this convention we have 


= 


Pll 

Pl2 

• • ^jPlj 

• PlZ 

P21 

P22 

• . HjPij 

• P2Z 

^iPil 

^iPi2 • 

• • ^Hjpij . . 

• hpiz 

Pzi 

PZ2 

^iPZj 

Pzz 


and performing the integrations as in § 27.3 we find that if i ^ j 

J dr = (Z-2)\ J I I dq,dq„ 

which may be reduced to the form 

J h^H^p dr 

= (Z-2)!( I (mlHlm) J Ht{Ti)Uu)dr,- 8 (m\H\p) f 

Similarly | htHtp dq = {Z—\)\ 2 J 8^*(r,)£l^(r<) dr,, 

SO that finally 

= Z! 2 J 8 {m\H\p)^„+ 2 dr, (30) 

p»l«' ' m m=l * 

(P) 

where the summation S extends over all those 0^s which have the 

m 

same spin factor as in the expression for 0. 

By a method similar to that employed in the last section for the evalua- 
C dr 

tion of the integral 8^ J we can establish that, for % 


/(?MgrD 


pdq 


-(Z!)( 2 8 (lml(?M)} f (31) 
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the summation 8 being taken over those sets of spatial wave functions 

l,m,n 

tjf which have the same spin factor in 0. The result for i = j or i — k 
can be derived in a similar manner; it is found to be 


(32) 

where G„„{t) = f 0*(r')^„(r')-i-^ ' 

J Ir -r| . (33) 

and V(r) = 2 <?„„(r) 

n 

Substituting from equations (26), (30), (31), and (32) into the variational 
equation (29) we obtain series of terms of the type fhifj*. Since we 
may treat the variations as independent, we may take / = 0; in 
this way we obtain a set of equations of the type 

+ ^S(lm\G\'d)^„ = 0, (34) 

l,m 


where denotes the number of times, 1 or 2, occurs in the wave- 
function 0. The system of linear equations (34) for the, determination 
of the functions were first derived by Fockf by a different method; 
the derivation outlined above was given subsequently by Lennard- 
Jones.J 

(n) 

Simple expressions for the summation symbols 8 can be derived 
in certain special cases. For example, if the atom consists of p pairs of 
electrons and q—p outer electrons possessing the same spin but different 
orbital quantum numbers, then when n refers to one of the orbitals in 
a closed shell {n < p) the wave function may be associated with either 
of the spin wave-functions x®, XjS- ^^en, = 2 and 


in) P Q 
^ = 2 + 2 , 


'S-ii + i i. 

l,m Z«=lm=l i=l m=l 


(36) 


On the other hand, if ^ < ti < g, = 1 and 



l,m 1=1 m=l 


(36) 


By this method we have reduced the problem of solving the Schrb- 
dinger equation in 3n independent variables to that of determining the 
solid'ions of a system of n partial differential equations each with three 


t V. Fock, ZeUa,f. Phyaik, 61, 126 (1930). 
t J. E. Lennard* Jones, Proc. Comb, Phil, Soc. 27, 460 (1930-1). 
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independent variables. The equations are not independent, but they 
lend themselves readily to solution by successive approximations. 


28.2. Hartree's method of the self -consistent field 

The simple method of calculating the field of an atom devised by 
D. R. Hartree and discussed in the last chapter for two-electron atoms 
may readily be extended to atoms containing many electrons. Each 
electron is assumed to possess a wave function which obeys a Schr5- 
dinger equation. The potential energy appearing in this equation is 
taken to be partly that due to the field of the nucleus and partly to 
the fields of the electrons in the other orbits. The latter fields are 
calculated by representing each electron by a uniform charge distribu- 
tion of density — in accordance with the SchrCdinger interpreta- 
tion of the meaning of the wave function The tth electron thus 
moves in a field in which its potential energy is 

F,(r) = 2 f (37) 

3^i ' 


where we have considered a system of Z orbital electrons moving in 
the field of a nucleus of atomic number Z, The wave function of 
this electron then satisfies the SchrOdinger equation 

[V2+2(ir-JJ)]^, 0 (38) 


with.Vt(,r) g^venby equation (37). 

In applying this method to atoms containing many electrons Hartree 
has neglected the ‘exchange’ forces. Indeed Hartree ’s equations may 
be derived from the system of equations (34) by neglecting 

(i) the coefficients of every for which m :^n; 

(ii) the non-diagonal exchange integrals {ln\G\nl), 

Pock’s equations (34) then reduce to 

[H+ I' = ((»|fr|w)+ 1' (lm\Q\lm)]UT), (39) 

the dash denoting that the term Z = n is omitted in the summation. 
It will be seen that (n\H\n) is the energy of the nih. electron in the 
field of the nucleus alone, {ln\Q\ln) is the averaged electrostatic 

energy due to the interactions between all the other electrons, and 
Gji(r) is the potential of the nth electron due to the Coulomb fields 

of all the other electrons averaged over their respective probability 
ohaige distributions. These equations may be solved numerically 
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of the spin wave-functions x®, XjS- ^^en, = 2 and 


in) P Q 
^ = 2 + 2 , 
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l,m Z«=lm=l i=l m=l 


(36) 


On the other hand, if ^ < ti < g, = 1 and 



l,m 1=1 m=l 


(36) 


By this method we have reduced the problem of solving the Schrb- 
dinger equation in 3n independent variables to that of determining the 
solid'ions of a system of n partial differential equations each with three 


t V. Fock, ZeUa,f. Phyaik, 61, 126 (1930). 
t J. E. Lennard* Jones, Proc. Comb, Phil, Soc. 27, 460 (1930-1). 
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and a different scale of r is used for r < 1 and r > 1. This is usual 
in diagrams of this type since a scale ojien enough to show the detail 
of the curves for r < 1 is unnecessarily open for r > 1 and the use of 
two uniform scales has seemed preferable to the distortion produced 
by a logarithmic scale. The regions of maximum charge corresponding 
to the L, and M shells are clearly evident on this diagram. There 
is, however, no peak corresponding to the N shell — only a flattening 
out of the curve; this is due to the fact that the contribution from the 
if -electrons is still appreciable in this region. 



The same features are evident in the curve (Fig. 20) showing the 
variation of the radial charge density for the chlorine ion Cl~. For this 
ioniboth Hartree’s and Fock’s methods have been usedf and the curves 
corresponding to both sets of calculations are shown. It will be noted 
from this diagram that the two curves have approximately the same 
sh|.pe but that for large values of r they differ appreciably. Indeed, 
if we denote by hV(f) the difference between the radial charge densities 
in the two cases, then it can be shown that in the units chosen 

Jr2 8{7(r)rfr = 13-8, 

. 0 

and further that about 95 per cent, of this value comes from values 
of *v greater than 3*4 atomic units, which is the ‘ionic radius’} of Cl". 
It would "thus be expected that the distortion of the wave functions by 
the presence of neighbouring atoms will be considerable in this region. 

t W. R. Hartree, Proc. Boy. Soc. A, 156, 46 (1936). 

i W. L. Bra^, The CryskUline State, chap, vii, p. 116 (1934). 
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If we calculate the diamagnetic susceptibility, from these curves 
we find by the Hartree method (Hartree and Hartree, loc. cit) that 

^41-3xl0-«, 

and by the Fock method that 

^ = -30*4x10-8. 

The observed valuef is — 25x 10-®, so that the use of the solutions of 
Fock’s equations has reduced the discrepancy between the calculated 
and observed valuer of x to about a third of the amount of the dis- 
crepancy when the results of the self-consistent field are used. The 
improvement which the solutions of the Fock equations makes to the 
calculated value of the electrical polarizability is also considerable. It 
seems then that the improvements effected by the use of the Fock 
equations — i.e. by the addition of ‘exchange’ terms to the Hartree 
equations — are substantial. 

Solutions of the Hartree and of the Fock equations for other atoms 
and ions have been obtained by several authors. The results are given 
in the papers cited below. No attempt has been made to divide these 
papers into two classes depending on whether the Hartree or Fock 
equations formed the starting-point of the investigation, since it not 
infrequently happens that in the same paper the solutions of both types 
of equations have been derived for purposes of comparison and also 
for determining the relative speed of convergence of both methods. 

D. R. Hartree Proc. Camb. Phil. Soc. 24, 111 Li, Rb+, Na+, Cl“. 

(1928) 

Proc. Roy. Soc. A, 141, 283 (1933) Cl", Cu+. 

143, 606 (1933) Cu+, K+ Cs. 

„ „ 150,96(1936) Al+8, Fe-, Rb+. 

Phys. Rev. (2), 46, 738 (1934) Hg. 

D. R. Hartree and Proc. Roy. Soc. A, 139, 311 (1933) 0 in various stages 
M. M. Black of ionization. 

D. R. and W. Hartree Proc. Roy. Soc. A, 149, 210 (1936) Be, Ca, Hg. 

150,9(1936) Be. ' 

„ „ 156, 46 (1936) Cl". 

„ „ 157,490(1936) Cu+ 

„ „ 164, 167 (1938) Ca++. 

„ „ 166,460(1938) K+ A. 

Proc, Camb. Phil. Soc, 34, 660 Na-, K“. 

(1938) 

J. MacDohgall Proc. Roy, Soc, A, 138, 660 (1932) Si+*. " 

E. H. Kennard and E. Phya, Rev. (2), 46, 1034 (1934) Na. i*- 
Ramberg 

t G. W. Brindley and F. F. H’oare, Proc. Roy. Soc. A, 152, 342 (1936); E. C. Stonor, 
Maiptetism and Atomic Structure, chap, ix, p. 4. 
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C. C. Torrance 

Phya, Rev, (2), 46, 388 (1934) 

C. 

L. P. Smith 

Phya. Rev, (2), 42, 176 (1932) 

He. 

V. Fock and M. J. 

Phya, Zeita, d, Sowjet, 6, 368, 

Na. 

Petrashen 

(1934) 



Phya, Zeita, d, Sowjei, 8, 647 

Li. 

M. M. Black 

(1936) 

Mem, Mancheater Phil, Soc, 79, 

Ag+. 


29 (1934-6) 

Mem, Mancheater Phil, Soc. 79, 

Cr. 

M. F. Manning and J. 

76 (1934-6) 

Phya. Rev, (2), 49, 849 (1936) 

W 

Millman 

M. F. Manning and 

Phya. Rev. (2), 53, 662 (1938) 

Fo. 

Goldberg 

W. ^ Wilson and 

Phyt. Rem. (2), 47, 681 (1936) 

He (excited state), 

R, B. Lindsay 

H. L. Donley 

Phya. Rev, (2), 50, 1012 (1936) 

Si+2, Si+® 

R. L. Mooney 

Phya, Rev, (2), 55, 667 (1939) 

Cr++. 

W. Thatcher 

Proc. Roy. Soc, A, 172, 242 (1939) 

K+\ K+. 

D. R. and W. R. Har- 

Phil. Trana, Roy. Soc. A, 238, 229 

0 

tree and B. Swirles 

(1939) 


A. F. Stevenson 

Phya. Rev. (2), 56, 686 (1939) 

He, Om. 

A. Juoys 

Proc, Roy. Soc. A, 173, 69 (1939) 

c+s C++. 

A. Q. Williams 

Phya. Rev, (2), 58, 723 (1940) 

0u+. 

D. RandW. R. Har- 

Phya. Rev, (2), 59, 229 (1941) 

Ge++, Ge 

tree and M. F. Man- 

„ „ 60, 867 (1941) 

Si IV, SiV. 

ning 

M. T. Antunes 

Phya. Rev. (2), 62, 362 (1942) 

Co. 

D. R. Bates and 

Phil. Trana, Roy. Soc. A, 239, 269 

O'", Oa". 

H. S. W. Massey 

(1943) 



28.4. Amlytical atomic wave functions 

One of the simplest methods of determining the wave functions and 
energies of atoms which possess only a few electrons consists in con- 
structing a wave-function 0 which has the correct symmetry properties 
bujj which contains certain parameters a, j8, y,... which are left un- 
specified. The most favourable values of these parameters are then 
determined by the variation method as follows. The function is substi- 
tuted in the equation 






(40) 


and the values of the parameters determined so as to make the eimgy 
W a mihimum. In this way we obtain the system of equations 


doc dp dy 
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for the determination of the parameters a, y,.... Wave functions 
constructed by this method have proved to be of great value in the 
approximate calculation of the properties of atoms containing many 
electrons. 

This method was used by HyUeraasf to give a very good approxima- 
tion to the observed ionization potential of helium. The form of wave 
function assumed by Hylleraas was 

corresponding to a helium atom in its ground state. 

The wave functions for the atoms li, Be, B, C, N, 0, F, Ne have 
been derived by this method by Zener and Guillemin.t For the 15- 
orbitals Zener assumes a wave function of the form 



0 = 

(42) 

for the 25-orbitals 

0 = 5rj(r— 

(43) 

and for the 2p-orbitals 


(44) 


Here <f>) denotes the surface spherical harmonic of the first degree, 

-^(3/87r)sin0e*^^. The function (42) is of the same type as the wave 

function for the hydrogen atom in the ground state; the parameter y 

may be regarded as allowing for the screening effect of the other 

electrons round the nucleus. For each atom the four parameters ri*, 

a, y, 8 are determined from the equations (40) and (41). The parameter 

n* has been varied only in the cases of Li and Be; since for both of 

these atoms it has the hydrogenic value 2*0, Zener assumed it to retain 

this value in the other cases considered. It is found that for an atom 

of atomic number Z the parameter y is given approximately by the 

formula ,z & un\ 

y = (46) 


The values of the remaining parameters are given in the accompany- 
ing table; by the use of these values the energies of the L-shell were 
determined with an error of one or two electron volts. 


Table 1 



Li 

Be 

B 

C 

N 

0 

P 

Ne 

«(±0-02) 

0-18 

016 



0-06 

0-06 

0-06 

0-06 

8(±0-01) 

0-63 


1-26 

1-59 

1-92 

2-24 

2^66 

2*88 


t E. Hylleraas, Zeits. f. Phyaik, 65, 209 (1930). 

i v! Guillemin and C. Zener, ibid. 61, 199 (1930); C. Zener, Phya, Eev. (2) 36, 51 
(1930). 
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Simple rules for obtaining the values of the four parameters y, a, n*, 8 
for most of the atoms in the periodic table have beeA given by Slater.f 
The charge densities derived from the functions (42), (43), and (44) with 
Slater’s values for the parameters agree fairly well with those found 
by Hartree’s method of the self-consistent field in the cases where that 
method has been applied. 

For some purposes there is not sufficient fiexibility in the Zener- 
Slater wave functions, and, in addition, the separate orbitals are not 
all mutually orthogonal. The method has been improved by Morse, 
Young, and HaurwitzJ and by Duncanson and Coulson§ to remove 
these defects. These authors take the wave function of an atom to be 
the appropriate sum of determinantal wave functions in which the 
wave functions of the component orbits are assumed to be 

^(l8) = 
i/r(2«) = 

0(2po) = I iT)r COB 6 

^(^P±) = >/(/^®c^/27r)rsin0e-/^‘^±^^. 

The numbers and Yg are numerical constants; is determined by 
the condition that the wave-functions ^(28), ipils) are orthogonal and 
Ni is a normalizing constant. The energy is minimized to determine 
the remaining four constants a, c, fi; if we are concerned solely with 
atoms in their ground states an absolute minimum of the energy exists. 
The results obtained by Coulson and Duncanson for the ground states 
of the atoms Li to Ne are shown below. The quantity 2/x may be 

Table 2 


Element 

ay, 

by 

C/i 


Slater's 
value of y 

Li 

2'69 

2-26 


0-858 

0-650 

Be 

^3-69 

3-28 

, , 

0-979 

0-975 

B 

4-69 

4-20 

1-20 , 

1-822 

1-300 

C 

6’69 

613 

1-56 

1-652 

1-625 

N 

6-68 

601 

1-91 

1-970 

1-950 

^ 0 

7-68 1 

l oa . ( 

2-22 

2-289^ 

2-275 

F 

8-67 

- 8 02 

2-65 

2-611 

2-600 

Ne' 

9-66 

'8-97 

2-88 

2-934 

2-926 


regarded as the effective nuclear cha^^ for the 28 - 9 lectron at large 
t J. C. Slater, Phy€. Rev, (2), 36, 5f (1930). 

X P. M. Morse, L. A. Young, and E. S.^urwitz, ibid. 48 , 948 (1935). 

§ W. E. Duncanson and C. A. Coulson, Proc. Roy, Soc, Edin, 62, 37 (1944). 
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distances, and is seen to increase by about 0*35 for each extra electron. 

The formula " 1//7 iv 

II = i(Z-l) 

gives II. in terms of the atomic number Z to an accuracy of about 2 per 
cent. 

29. The perturbation method of determining approximate wave 
functions of atornsj 

29.1. Non-degenerate cases 

In addition to the equations of Fock and Hartree, other approximate 
methods of determining the wave functions and energy levels of many- 
electron atoms have been devised. Suppose, for example, that the 
SchrOdinger equation (in atomic units) for an atom containing many 
electrons is written in the form 

= (46) 

where H denotes the Hamiltonian 

(47) 

Z being the number of electrons. If the equation (46) cannot be solved 
directly, it is often possible to obtain an approximate solution from the 
solution of an equation which is separable and which does not differ 
greatly from (46). Suppose that the wave functions and characteristic 
values of the Schr 6 dinger equation 

(H'-lf)O = 0 (48) 

z z 

are known, where = — J T V|+ T F'-. (49) 

jZi ^ jti ^ 

Here each of the functions F^ is a spherically symmetrical function 
of r^. If a non-degenerate solution of the equation 

( 60 ) 

dan be obtained, the solution of equation (48) con be written as 
determinant whose elements are solutions of equation (50); in most 
physical problems there is thus more than one wave function 0 ;^, for 
.evfry characteristic value Wj^ of the wave equation (48). For simplicity 
we shall, however, assume, in the first instance, that there is only one 
solution Ojj, to each value of WJ. and consider the degenerate case later. 
In the non-degenerate case thb functions then form an orthogonal 
set. Equation (48) may still have non-degenerate solutions even if the 

t J. E. Lennard-Jones, Proc. Roy. jSoc. A, 129, 698 (1930) ; Joum, Land. Math. Soc. 6, 
290 (1931), 

Wt% > 


L 
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solutions of (50) are degenerate. Erovided the only degenerate solutions 
of (50) used in constructing O are complete sets, then 0 will be non- 
degenerate; e.g. for the helium atom in the ground state we construct 
<I> from the two l5-wave functions, and there is no other way of con- 
structing 0 to give the same energy. Moreover, provided the solutions 
of (50) are chosen to be orthogonal, as they always can be, the functions 
0 will be all orthogonal whether they a»e degenerate or not. 

Let us write J 0*0; dr = (A;|l|^), (51) 

♦ 

where the integral is taken over all the coordinates, spatial and spin, 
then if the wave functions O^j. are normalized to unity (k\l\l) is merely 
the Kronecker deltaj and (A;|1|Z) always vanishes for h 

To obtain the approximate solution of equation (46) we observe that 

z 

we have supposed equation (48) to be such that ^ F'* does not differ 

3=1 

greatly from F ; we may thus write equation (46) in the form 


(ff'-lF)fl>= - dO, (62) 

where v= V— ^V'j (63) 

is small. It is assumed that the solution of equation (52) can be ex- 
pressed as a linear combination 

^ ^ 

of the solutions of equation (48). Substituting this series in equation 
(52), multiplying by O*, and integrating over the whole range of all 
the variables specifying the wave functions, we obtain a system of 
linear algebraic equations 

|a;,{(llif'|fc)-lF(llll*)+(l|t;lfc)} = 0 (1=1,2,.,.) (64) 


for the determination of the coefficients and the possible values of W, 
In these equations we have written, in the usual notation for matrix 
elements, 

(1141fc) = J,«I)?4<D*(lj. . 

It follows immediately from this definition and from that of v 

{l\H'\k)+(l\v\h) = (llfllfc). 


which is the matrix element of the Hamiltonian of the equation (46) 
with respect to the wave functions of the SchrOdinger equation (48). 


t The Kronecker delta is defined by the relations ^ 0 (fc 9 ^ 1), ~ 1 . 
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The system of algebraic equations (54) may then be written in the form 

= ■ ( 66 ) 

In the general case the wave-function 0 is an infinite sum of terms of 
the form so that, in general, the system (65) consists of an infinite 
number of equations in an infinite number of unknowns. Eliminating 
the coefficients from the set of equations (56) we obtain the determi- 
nantal equation , 

(ll^ll)-Tf(lllll) .(11^12) (im) . . . 

(2\H\l) (21^(12)- 1F(2|1|2) {2\H\Z) • • -^0 

(31^11) (31^12) (3|£fl3)-F(31113) . . . 


In the case where the wave-functions are normalized to unity, 

(im = 8 „; 

also (l\H'\k) = 

where Wj^ are the eigenvalues of equation (48). The determinantal 
equation then reduces to 


1— • 

1 

(11«|2) 

(lb|3) . . . 


(21»11) 

(2|*;12)-(1F-Wi) 

(2|t>|3) ■ . . . 

= 0. 

(3|«11) 

(zm 

(3|t,|3)-(ir-ir3) . . . 





(66) 


Since the number of equations of the system (56) is infinite, the determi- 
nant on the left-hand side has an infinite number of rows and columns. 
The eigenvalues are known, and once the functions are determined 
the matrix elements (l\v\k) can be determined by quadratures, and the 
values of W found. Suppose, for definiteness, we wish to determine the 
change in the value of W^. As a first approximation we assume 

mic) = 0, 

except when 2 = 1, A; = 1, so that 

If = Hi-f(l|t;ll). 

The second approximation is obtpned by assuming (l\v\k) to be zero 
except when either lor k is 1. Then t 

(i|i,li)-(ir-iri) (ii«i2) (ii«i3) . . . 

( 2 H 1 ) w,-w - 0 . . . 

(3|»|1) 0 Wi-W . . . ’ 
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whence we obtain the equation 

If = (57) 

for the determination of W. More accurate solutions are obtained 
by including more of the elements (l\v\k) in the expansion of the 
determinant. 

When the energy values have been determined by some numerical 
method, the corresponding wave-functions 

are obtained from the solution of the set of equations (56). The wave 
function can thus be written as a determinant of the type 




(2|«|1) (2|t)|2)-(If-lf2) . 

. . (2\v\m) 

(m\v\l) {m\v\2) 

. . . . . 


29.2. Degenerate cases 

We now consider the case, which occurs most frequently in practice, 
in which there is more than one wave function <!)* corresponding to 

each value of the energy Suppose 
there are a such functions associated with 
the same eigenvalue WJ. of the energy; 
denote these wave functions by 

O'fc n- (68) 

If we denote by Aj^ the square of a* ele- 
ments corresponding to the set of wave 
functions (68), then the determinantal 
equation (56) assumes the form shown in 
Fig. 21, where etc., are the square 
arrays corresponding in a similar way to the energies HJ+i, etc. It should 
of course be noted that the elements of the determinant outside the 
shaded squares do not in general vanish. 

The determinant A^ can be transformed to one in which all the 
elements are zero except those on the diagonal by a method siihilar 
to that of changing the axes of a quadric in higher space. Since the 



Fio. 21. 
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wave functions are degenerate it follows that the 

wave function « 

(59) 

i=l 

formed from a linear combination of the functions 0|. will also corre- 
spond to the energy value WJ.. The function (59) will be orthogonal to 
a similarly constructed wave-function 

( 89 ) 

i=l 

provided that, when g is not equal to A, 

Thus if we can determine a set of coefficients such that 

2 PihPig = 

the new set of wave functions ^ form an orthogonal set. With the 
set of wave functions so constructed the determinant Aj. becomes a 
diagonal one. Similar transformations can be carried out for the square 
matrices giving finally an equation of the type of Fig. 21 but 

now with the matrix given by 



0 . . 

0 N 


(2\v\2Y-(W-W,). . 



0 . . 

Mv\ocy-{w-wJ 


the symbols (g\v\hY denoting the matrix elements 

(sfi«i^)' = J 

and -djfc+i... of similar type. 

The transformed determinantal equation may then be solved approxi- 
mately by neglecting all the elements except those on the leading 
diagonal and successive approximations obtained by a variety of 
methods depending on the relative magnitudes of the elements neglected 
in the first step. Thus as a first approximation for the change in JfJ we 

obtain W = W,+ jn*v9i^dq, 

where g — I, 2,..., a; as a second approximation we may omit every 
(k\v\l) except those within the squares Aj^^ Ajg+iy,.,, and those of one 
row and column which intersect on the diagonal, and so on as in the 
case of non-degeneracy. 
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whence we obtain the equation 

If = (57) 

for the determination of W. More accurate solutions are obtained 
by including more of the elements (l\v\k) in the expansion of the 
determinant. 

When the energy values have been determined by some numerical 
method, the corresponding wave-functions 

are obtained from the solution of the set of equations (56). The wave 
function can thus be written as a determinant of the type 




(2|«|1) (2|t)|2)-(If-lf2) . 

. . (2\v\m) 

(m\v\l) {m\v\2) 

. . . . . 


29.2. Degenerate cases 

We now consider the case, which occurs most frequently in practice, 
in which there is more than one wave function <!)* corresponding to 

each value of the energy Suppose 
there are a such functions associated with 
the same eigenvalue WJ. of the energy; 
denote these wave functions by 

O'fc n- (68) 

If we denote by Aj^ the square of a* ele- 
ments corresponding to the set of wave 
functions (68), then the determinantal 
equation (56) assumes the form shown in 
Fig. 21, where etc., are the square 
arrays corresponding in a similar way to the energies HJ+i, etc. It should 
of course be noted that the elements of the determinant outside the 
shaded squares do not in general vanish. 

The determinant A^ can be transformed to one in which all the 
elements are zero except those on the diagonal by a method siihilar 
to that of changing the axes of a quadric in higher space. Since the 
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with respect to these wave functions and the determinantal equation 
constructed in the manner described above. The fifteen degenerate wave 
functions lead to a square matrix of fifteen rows and columns, and the 
first step in the calculations is to determine such linear orthogonal 
combinations of the wave functions as will transform this square matrix 
into one with diagonal elements. This procedure is repeated for the 
other square matrices and the roots of the determinantal equation 
determined by the method of successive approximations outlined above. 

This cumbersome procedure can, however, be avoided by a method 
due to J. C. Slater based on the above analysis; this will be described 
in the next section. 

29.3. Slater's method for complex atoms^ 

Slater’s method of determining the values of the energy levels of 
complex atoms is based on the fact that the wave function (61) repre- 
sents, not only a state of definite energy, but also one with definite 
angular momentum. The angular momentum about the z-axis 6 = 0 
of a single electron defined by the wave function (61) is m measured 
in atomic units (mh in ordinary units). In the carbon atom we suppose 
as a first approximation that the six electrons move independently in 
the same central field and that the angular momentum of each electron 
is given by the value of m in the appropriate wave function. By 
assuming that the six electrons are moving in the same field we obtain 
the total angular momentum (in atomic units) merely by adding the 
m’s appropriate to each of the electrons; thus denoting this quantity 
by Jf, we have = (62) 

The field in which the electrons move is not, of course, spherically 
symmetrical. The electrons exert forces on one another and it is no 
longer possible to determine the angular momentum of the Mividml 
electrons. These forces cannot, however, affect the total angular momen- 
tum; the value of given by equation (62) therefore remains unaltered 
despite the interactions of the electrons. 

, In a similar way the total angular momentum due to the spins of the 
electrons is given by the formula 

‘ (63) 

where s denotes the angular momentum in the z-direction due to the 
spin of a single electron moving in a spherically symmetrical field. In 
atomic units s assumes either of the values ± J. 

t J. C. Slater, Phys, Rev, 34, 1283 (1929). 
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The values of Mg and 8g for the fifteen wave functions corresponding 
to the lowest energy of the carbon atom are shown in Fig. 22, the 
numbers of wave functions with the same values of Mg and 8g being 
shown in the circles which are joined to points whose coordinates are 
the appropriate values of 8g and Mg, 

Now the Hamiltonian operator is not changed by a rotation about 
the ^-axis so thatf the angular momentum commutes with the Hamil- 
toniajifl. Thus M,H-HM, = 0. (64) 

A typical matrix element of MgH is by the definition of a matrix 
product \M,H\n) = 2 {m\MMl\H\n) 

= (m.\M^\m){m\H\n), 

sitice Ml, is a diagonal matrix and therefore (m\Mi,\l) = 0 if m ^ 1. 

| {m\H\l)[l\M,\n) 

= (m\H\n)(nmn). 

It follows at once that equation (64) is satisfied only if 

{m\H\n) = 0, 

when {m\Mg\m) # {n\Mg\n), Similarly it can be shown that (m\H\n) = 0 
when (m\8g\in) is not equal to {n\8g\n). Thus the elements of the 
determinant on the left-hand side of equation (58) are zero, except 
when the two wave functions defining the matrix element have the 
same values of the momenta Mg and 8g, 

By means of this result and Kg. 22 we see that the matrix Aq corre- 
sponding to the lowest energy of the electrons reduces to the form 

r • 1 ^ 





( 66 ) 


the dots denoting the 2Q of the 225 matrix elements which are not 
zero; the matrix Aq, thus consists of one square of three rows and 


t See equation (29 a), Chap. XII. 
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columns, two of two rows and columns, and eight diagonal elements. 
The roots of the determinantal equation 

l^ol = 0 (66) 

may, or may not, be distinct. To determine which, if any, of these 
roots are multiple we consider next the total angular momentum M 
and the total spin momentum S, 

Not only the component but also the total spin is a constant 
of the motion. Furthermore, if the total spin is /Sf, 8^ may assume any 
one of the 28-i-l values 




0 

0 ) 




S, 8-1, S-2,..., -[8-2), -(8-1), -S; 
and if we neglect the interaction of the spins with the space coortliiiaies 
and the mutual interactions of the spins, to 
each of these values of 4; there must corre- 
spond an equal value of the energy. Similarly 
the total angular momentum if is a constant 
of the motion and it may be proved that to 
each component of if there is related an 
equal value of the energy. It can also be 
proved that: (1) The eigenvalues of if^ are 
L(L+l) where L is an integer. (2) To each 
eigenvalue of if^ there correspond 2L+1 in- 
dependent wave functions which can be 
chosen. 

The various groups into which the fifteen 
values of (8^, if^) of Fig. 22 fall are shown in 
Table 3. In assigning a name to each of the three groups we have 
followed the language of spectroscopy in which a multiplet is denoted 
by S, P, D,... according as if = 0, 1, 2,..., the multiplicity 2S+1 




0 ) 




Fig. 22. The S^-M^ diagram. 


Table 3 


M 

8 

Values of {8,, M^) 

Name of group 

Energy 

0 

0 

(0, 0) 

^8 multiplet 


1 

1 

i 

(-1,-1), (-1,0), (-1, 1) 
(0,-1), (0,0), (0,1) 

(1,-1), (1. 0), (1,1) 

multiplet 


2 

0 

(0,-2), (0,-1), (0,0) 

(0,1). (0. 2) 

multiplet 



due to the spin being written on the top left-hand side. Thus for 
if = 0, S = 0 we write ^8, etc. From this table it will be observed 
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' that the values = 0, = 0 are common to the ^8, ®P, ^2) multiplets 

and Sg. Ot Mg — ±l are common to ®P, W. The table also shows 
tiiat there are actually only three distinct values of the energy which 
we have denoted by or, which is the same thing, that there are 

only three distinct roots of the determinantal equation (66), one being 
a simple root corresponding to the ^8 multiplet, another a ninefold root 
for the ®P, and the remaining one a fivefold root for the W, 


6'elecfcfon system “►Pcrturbation-^Abomic levels 


Fifteen coincident 
levels 



'S (one level) 

•’D (five coincident levels) 


.^P(ninc coincident levels) 


Fig. 23. The lower energy levels of the carbon atom. 


So far we have made no approximation beyond assuming that the 
energies of the mutual interactions of the spins and the interaction of 
the spins with the space coordinates are of a lower order of magnitude 
than the energies being considered here. In considering the determi- 
nantal equation (Fig. 21), however, we introduce certain approximations 
by rejecting some of the elements. The determinant on the left-hand side 
of the equation is made up of a chain of squares grouped together along 
the diagonal and other elements lying outside these squares. As a first 
approximation we assume that these elements Are so small that they 
may be neglected. The equation (Fig. 21 ) then splits up into two or more 
equations, one of which is I _ q 

Now since the values 8g — Oy Mg = 0 are common to the ^/S, ®P, W 
multiplets, it follows that the three-by-three matrix in (65) leads to a 
factor of the form 

(W-Wj)(W—W2)(W-w^) = 0, (67) 

and the twp-by-two matrices lead to factors 
(W’-‘W^)(W-w^) = 0 , 
and the complete determinantal equation is 

{W 0. 

It follows that two of the roots oi the equation can be found by a direct 
calculation based on the wave functions of the original six-electron 
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system. It is not necessary to calculate the wave functions of the atom 
itself, since these roots can be derived from a formula of the type (26). 

To determine the third root we make use of the fact that if we write 
the three-by-three factor of the equation 1 ^q| = 0 in the form 




“'12 

-W+u, 

u. 


22 


-W 


= 0 , 


“'31 “'32 “ 33 “ 

it must be equivalent to equation (67). Equating the coefficients of 
in these two equations we obtain the relation 


w^+w^+w^ = ^^ 11 +^ 22+^33 

for the determination of once Wi and W 2 are known. Thus it is only 
necessary to calculate the diagonal elements of Aq to determine all 
three roots Wi, w^. 

The method of Lennard- Jones and Slater sketched briefly above may 
be. used to determine the energy levels of a complex atom in all cases 
in which the multiplets differ in type. If, on the other hand, there are 
two or more multiplets of the same type, say, multiplets, the above 
method yields only the sum of the energies of these multiplets. To 
evaluate the individual energies it is necessary to find those combina- 
tions of wave functions which make the appropriate square of the 
determinant diagonal. 

In conclusion, we summarize the main steps in the determination 
of the energy levels of complex atoms by the perturbation method; 
they are: 

(i) determining the wave functions of the electrons separately in 

z 

a spherically symmetrical field J V]\ 

(ii) deducing the approximate wave functions of the many-electron 
system in determinantal form; 

(iii) calculating the diagonal matrix elements of F— £ with 

jmiX 

respect to these approximate wave functions. 

We indicated above that in many problems of physical interest elements 
of the infinite determinant lying outside the chain of squares strung 
along the leading diagonal are so small that they may be neglected. 
In the preceding analysis they were taken to be zero, but it is possible 
to obtain better approximations by taking them into account. Methods 
of doing so have been considered by Lennard-Jones.f 


t J. £. Lennard-Jones, Proc. Boy. Soc. A, 129, 598-6U( (1930). 
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30. The statistical method of Thomas and Fermi 

A semi-classical method of finding the density of electrons in a heavy 
atom has been formulated by Thomasf and Fermi. J This forms a fair 
approximation for heavy atoms only. The method is as follows: as in 
the method of Hartree, the electrons are pictured as moving in an 
electrostatic potential <[) due to the nucleus and to the charge cloud 
of the electrons themselves. Electrons are supposed, much as in the 
theory of metals, to have all energies up to a maximum — The 
maximum kinetic energy is then, at any point (x, y, 2 ), equal to e(0 — Oq), 
and hence the maximui^ momentum of any electron at the point {x, y, 2 ) is 

( 68 ) 

We now assume that a volume 8F can be chosen at any point containing 
a large number of wave-lengths (h/p) but such that 0— Oq is virtually 
constant within it. With this assumption, assuming that all electrons 
are in their lowest state and that they obey the Pauli exclusion 
principle, it follows that if n{x,y^z) 8F is the number of electrons in the 
volume element o vs 

The 2 has its origin in the fact that two electrons are to be found in 
each quantized state. 

Substituting for from equation (68) we have 


n(x,y,z) = 


(69) 


The potential function 0 is now determined by the Poisson equation 

= — 477/0, 

and p = —we, n being given by equation (69). Thus finally we obtain 
for 0 the non-linear equation 

V20 = a(O-Oo)^ (70) 

3277 %, 


where 


-(2mc)^ 


(71) 


The boundary conditions to be satisfied by the potential function O 
are readily determined. If the atom or ion is of atomic number Z, then 
for small values of r, is very nearly equal to Zejr\ thus the boundary 
condition limrO = 2;e (72) 

r-K) 

is valid. 

t L. H. Thomas, Proc, Camb. Phil. Soc. 23, 642 (1927). 

{ E. Fentai, Zeita.f. Pkynk, 48, 73 ; 49, 650 (1928). Sae also L. Brillouin, UAtome dt 
Q^homaa-Permi et la vMode du champ ^ae^f-consistent* (Paris, 1934). 
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On the other hand, if the total number of electrons in the atom is 
N, then 

j n{x,y,z)dT = N, (73) 

where the integral is taken over all space; in the case of a neutral atom 
N = Z. 

Assuming radial symmetry and introducing new variables 


I = {Ze)io^r, = 


^(O-Oo) 


a*(Zc)* ’ 

we may write equation (70) in the simplified form 


(74)' 


^ = (^>0). 

The boundary condition (72) is now replaced by 

fim^(l) = 1. 

f-H) 


(75) 

(76) 


Now the number of electrons in the volume between r = 0 and r is 


T 

z{r) — J A^hi(r) dr, 

0 

and, by Poisson’s equation, we can replace n by so that 


r i 

z(r) = ^jv‘Or^dr=zJ^idi. 

0 0 

Integrating by parts and introducing the boundary condition (76), we 
2(r) = z(l+|^-^j, (77) 

= (78) 


have 


whence 


This equation gives a simple method of determining z(r) graphically 
from the curve determined by the differential equation (75), the 
boundary condition (76), and a further boundary condition pf the type 


lim^ = — m. 


(79) 


Suppose thati4PP' is a curve determined in this way (cf. Fig. 24); A is 
the point (0, 1), P is (f, ^). If we draw the tangent to the curve at P and 
produce it to cut the ^-axis at B, then it is easily seen that the distance 
OB Is equal to and hence by equation (78) to 1— 2 (r)/Z. 

Since OA = 1 it follows that AB = z{r)IZ; we thuB have a simple 
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graphical method of determining the number of electrons within a 
sphere of radius r centred at the nucleus of the atom. 

The solution of equation (75) presents some difficulty; it is non-linear 
and involves a singularity at ^ = 0. Furthermore, the boundary 
conditions cannot be satisfied directly since one must be satisfied at 



the singular point f = 0 and the other at infinity. Since is infinite 
at f = 0 it is not possible to use a Taylor expansion in the neighbour- 
hood of the origin; instead we use an expansion of the form 

With this form for the function ^ we have 

and 

so that equating the two expressions we obtain 

a = |, 6 = 0, c — — d — \a = 

The coefficients of higher powers in the expansion have been determined 
by Baker ;f he finds that for small values of f ^ is given approximately 
by 

(80) 

Numerical solutions of equations (76) for larger values of f have been 
obtained by Baker, and also by Bush and CaldweD,t making use of 
the differe^ial analyser. It is found that there is only one solution 
of ^^uation (76) which tends uniformly to zero as f->oo, the one 
oortesponding to a value 

in equation (79). When m exceeds the critical value the curve cuts 

t E. B. Baker, Phya. Rev. 36, 630 (1930). 
i V. Bush an4 S. H. Caldwell, Phya. Rev. 38, 1808 (1931). 
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the f-axis at a point |o (cf- J’ig* 25); and when m <m^ the curve 
decreases steadily to a minimum point then increases steadily 

to infinity as | approaches a certain value a, and finally decreases to 
zero as ^ 0. Curves relating to are given in the paper by Baker. 



Asymptotic formulae for these solutions have been derived by 
Sommerfeld;t for m = he obtains for large values of | 

^ ( 1 144 

^'"(l+(^/12»)0”2l 

For large values of f this reduces to 

^ ^ 144f-3, 

a solution which may be obtained by substituting 

into equation (75) and equating powers of f on both sides of the 
equation. Similarly when m < it can be shown that 


4'- 

For neutral atoms we have 


400a 

(f-a)4 




limz(r) — N = Z; 

r-*«) 


in other words, the curve ^(f) tends to infinity in such a way that the 
tangent to it passes through the origin, since 

limOB = 1 — ilim«{f) = 0. 

^-+00 


The only curve which satisfies this condition is the curve A A* for which 

t A. Sommerfeld, Zeits.f. Phyaik, 78, 283 (1932); 80, 416 (1933). 
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= ^rrii when f = 0. The curve is asymptotic to = 0, i.e. 
O— Oq = 0, so that if <& is taken to be zero at infinity, Oq must also be 
zero, which means that the maximum energy — cOq of an electron in 
a neutral atom is zero. 



Fig. 26. Radial distribution of charge for Rb+. 


For positive ions the number of electrons N is less than the atomic 
number Z, so that 

limOJ5= 

Jn this case we choose a curve with m >• Mi such that the tangent to 
the curve at the point P(fo»^) where it cuts the axis ^ = 0 passes 
through the point J5(0, 1 — NjZ). The equation of the line BP is 

which in the old coordinate system reduces to 

and so represents the Coulomb field outside the ion. It follows at onw 
Eo = -efl>o < 0, 

shoviDg that the maximum energy of the edectrons in the ion ii 
negative} the electrons cannot therefore escape from the ion. 

As an examplef of the kind of results obtained by the Fermi-Thoma 
method, Fig. 26 shows the density of electrons in the heavy ion 
rubidium'*'. The electron density is readily calculated from equatioi 
(69) once the wave function ^ defined by equation (74) has been foun( 
by numerical integration of the non-linear differential equation (76] 

t D. R, Hartree, Proc, Camb. Phil, Soc* 24, 121 (1928). 
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The full curve shows the radial charge density calculated in this way 
and the dotted curve that calculated by the method of the self-consistent 
field (§ 28.2 above). It will be noted from this curve that the Thomas- 
Fermi method gives a good smoothed-out approximation to the distri- 
bution of charge constituting the self-consistent field. For this reason 
it is often useful to use the Thomas-Fermi method to obtain a first 
approximation to the field in the Hartree method. 

It would also appear that if the details of the field or charge distribu- 
tion are not considered to be important the Thomas-Fermi field or 
charge distribution may be sufficiently accurate. It should, however, 
be remembered that in this method the radial charge density for small 
r is proportional to while in the wave-mechanical treatment it is 
proportional to r^. 


507S 
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INTERATOMIC FORCES 


31. Introduction 

All treatments based on quantum mechanics of the forces between 
atoms and molecules make use of a theorem due originally to Born and 
Opj)enheimer.t This states that any problem in which atoms or 
molecules interact can be treated in the following way: 

First one supposes that the nuclei of the atoms concerned are at rest 
at points R^, Ra,—? Rjv, ^ being the total number of nuclei in the 
system under consideration. One then obtains solutions of the SchrO- 
dinger equation for the electrons of the system moving in the field of 
these nuclei. These solutions will be written 0^(R; q), where q are the 
coordinates, including spin, of the electrons, and BJi{R) arfe the corre- 
sponding energy values. One then treats the motion of the nuclei by 
means of a second Schr6dinger equation, in which WJi(R), together 
with the Coulomb interaction between the nuclei, plays the part of 
the potential energy function. If Y(R) is a solution of this equation, 
the wave function of the whole system is taken to be 

T(R)^„(R,g). (1) 

This approximation is used in almost any problem involving inter- 
atomic forces, such as 

{a) The vibration and rotation of a molecule. 

(6) The (gas-kinetic) collision between molecules. 

(c) Calculations of the lattice energy or elastic constants of solids. 

The use of the wave function (1) is of course an approximation only; 
the neglected term in the wave function is of order (m/lf)*, where m 
is the electronic mass and M the nuclear mass. For a proof, the reader 
is referred to the original paper. 

31.1. Rotation and vibrcUion of diatomic molecvles in their lowest ' 
electronic states 

The problem of calculating the energy levels has already been treated 
briefly in Chapter V. We start by assuming that the electronic problem 
is solved, and that the energy of the electronic S 3 rstem in its ground 
state is known for all separations B of the nuclei. Let U{B) denote 
the sum of this energy and the Coulomb interaction of the nuclei. 

t M. Bom and J. R. Oppenheimer, Ann, der Physik, 84, 467 (1927). 
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Then the SchrOdinger equation for nuclear motion is, according to the 
approximation introduced above, 

As already shown (§ 23.2), if the centre of gravity is at rest, this equation 
reduces to ^2 

^V^+{W-U(R)}^ = 0, 


where ^ is a function of R, the (vector) distance between the nuclei 
and M — Taking spherical polar coordinates, ijj takes 

the form ^ 

where / is the solution vanishing at the origin of 


(2) 

The function U{R) will depend on the electronic ground state of the 
molecule under consideration, and will not in general be expressible in 
terms of simple functions. For stable molecules the energies of the 
lower states can usually be found by writing 


U(R) = U,+iK(E^R,r, 

which is a good approximation near the minimum of the energy curve. 
This method was adopted in Chapter VI. 

Another method is to make use of the function introduced by Morse 
i® U{R) = 


The form of this function is qualitatively of the type that we expect 
for the force between atoms, except that it does not tend to infinity 
when R tends to zero; this is unlikely to be of importance. For the 
Morse function an exact solution of the SchrOdinger equation has been 
given for A; = 0 and an approximate one for Jc ^0. The energy values 
are (if the minimum of the curve is taken to be zero) 

hv(n+l)--xv^{n+l)^+Bk(k+l)+Dk\k+l)^'-(x(n+\)k(k-\-l), 


where 


a II2F\ 



B = 


siT^r 


D: 




/ 1 

lQTj^MrlD\arQ dh% 


'4 


It will be seen bow, in this formula, the term %v(n-4'i) represents the 


t P. M. Morse, Phya. Rev. 34, 67 (1929); C. L. Pekeris, ibid. 45, 9S (1934). See also 
Pauling and Wilson, Introduction to Qtiantum Mechanica, p. 272. 
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vibrational levels, Bk(k-]-l) the rotational levels, and that the other 
terms are of smaller order of magnitude for small n, k, 

31.2. Review of types of interatomic force 

We must now give an outline of t 3 rpes of force between atoms and 
ions which give rise to the formation of molecules and solid bodies. 
Among these the simplest are the purely polar forces between ions. 
Thus a very satisfactory theory of the properties of crystalline alkali- 
halides (e.g. NaCl) has been built up on the assumption that these 
materials are formed from positively charged metal ions and negatively 
charged halide ions, which attract each other according to the inverse 
square law. An account of this theory is given in Chapter VIII. The 
introduction of quantum mechanics does not affect the calculation of 
electrostatic attraction between ions; it does, however, affect our ideas 
of the extent to which the forces other than the electrostatic attraction 
affect the cohesion. 

Secondly there are the so-called van der Waals forces between two 
neutral atoms or molecules. The van der Waals force may be defined 
as the force between two atoms or molecules when they are so far 
apart that their wave functions do not overlap. In general the potential 
M energy of two molecules at a distance B apart, where B is great com- 
pared with molecular dimensions, can be expressed in the form 

-AIB\ 

where A is a constant. 

All other forces between atoms or molecules which are neither charged 
nor carry a permanent dipole arise only when they approach closely 
enough for there to be an appreciable overlap between the space covered 
by their respective wave functions. These ‘overlap* forces ar6 of three 
main kinds: 

(а) The repulsive forces which arise mainly between closed shells, 
such as rare gases and ions like Na+, C1-, Ag+ ; 

(б) The homopolar forces responsible for the bonds such as those 
between hydrogen atoms in or between carbon in diamond, 
grapMte, or the molecules of organic chemistry ; 

(c) Metallicr forces; by these are meant the forces between the atoms 
of a metal or alloy in the solid or liquid state. 

32. Van der Waals’ forces 

32.1. CakfulaMon of van der W cuds* forces by classical methods 

We have |ieen in the previous chapters that an atom may be thought 
of as consisting of a small positively charged nucleus surrounded by an 



§32 


VAN DER WAALS* FORCES 


165 


electron cloud. For atoms which have spherical symmetry the centre 
of this electron distribution coincides with the nucleus. If, however, 
the atom is placed in a uniform electric field, the electron cloud is 
distorted in such a way that its centre is displaced away from the 
nucleus causing the atom to have an electric moment. The atom is 
then said to be polarized. It can be shown in a classical way that the 
van der Waals forces between neutral atoms can be interpreted in terms 
of these polarizations. We shall discuss this method before proceeding 
to the more accurate quantum mechanical calculations. 

If the strength F of the electric field is not too great we expect the 
dipole induced in the atom to be proportional to F, Denoting the 
magnitude of the dipole by p we may then write 


p = olF, (3) 

where the constant a is called the coefficient of polarizability. The 
energy Wp of the dipole p in a constant field F is given by 

F 

Wj,= - jpdF=-lccFK (4) 

0 

If the field F is fiuctuating about the value zero the mean of the 
polarization energy Wp remains finite, 

Wp = — ( 5 ) 

since the mean of the square of the field F^ will not be zero. 

Let us now consider the field F due to n dipoles of strengths placed 
at a great distance r from the atom and inclined at angles 6^ to the 
vector r. Then if r is sufficiently great the field will be uniform, and 
have components 

n ^ n ^ 

A neutral atom in which electrons move in the field of a nucleus may 
be represented by an ensemble of fluctuating dipoles of this type; we 
may thus write 

l^n = 

where is the distance of the ?ith electron from the nucleus. For such 
an arrangement we have 




f*’ 


j3 being interpreted as a kind of niiton square of the dipole moment of 
an atom. Substituting from equation (6) into equation (5) we find that 
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the effect of this atom on another distant atom is to change the energy 
by an amount given by 

(7) 

The interaction energy of two atoms with atomic polarizabilities aj, ag 
and mean square dipole moments jSg is thus 

JY — 

it being assumed that the effects are additive. 

This classical method of calculation shows the characteristic features 
of van der Waals’ forces. The negative sign in the expression (8) for 
the interaction energy implies that the two atoms attract one another. 
The force between the two atoms varies as the inverse seventh power 
of the distance; we shall sec that this conclusion is confirmed by the 
use of the methods of wave mechanics. It is found, however, that the 
multiplicative constant oliP 2 '^°^ 2 Pi given by the classical theory is about 
five times too great if the value of j3 is deduced from measurements 
of the magnetic susceptibility.f 

32 . 2 . Polarizability of an atom in an electric field 

The considerations of the last section show that in order to obtain 
the van der Waals attraction between atoms it is necessary to calcu- 
late the dipole induced on an atom placed in a static electric field, and 
thus the polarizability of the atom. 

The calculation will first be carried through for an atom with one 
electron in a Coulomb field. The coordinates of the electron are denoted 
by Then if an electric field F is introduced along the 2 -axis 

the perturbing term introduced into the Hamiltonian, in other words 
the potential energy of the electron in this field, is 
» V = -Fez, (9) 

The SchrOdinger equation then becomes 

rji^+^I^W+~eFroo8ey = 0. ( 10 ) 

If we take , 0o = = —e^l2a 

and write ^ = ^ 0 ““^ = Wl—^®os0. jR(r)}, (11) 

then W will differ from WJ by a term of order FK Thus substituting 
from (11) into (10), making use of the equation 

VV.+^(wi+7)0« = O, a2a) 

t P. Debye, Phya. Zeita. 21, 178 (1920); 22, 302 (1921). 
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The ordinary differential equation for E{r) then becomes 


d^R Jl mR 2R ^ 
dr^ \r ajdr ““ 


which has the particular integral 




To determine the complementary function we let 




and equate powers of p— 2 to obtain the indicial equation 


(126) 


(13) 


(14) 


(p-l)(p+2) = 0. 

The root p =^-- —2 must be rejected since it would not lead to a function 
R which tends to zero with r. Putting p — 1 and equating coefficients 
of we obtain the recurrence relation 


^ n-\-\ 

a n(n+2y 

which shows that, as -> cx), 

2 

^n-i dn 

so that, for large values of r, 

R(r) ^ 

Any wave function ^ containing this complementary function violates 
the boimdary conditions; ^ is therefore given by equations (11), (14). 
Hence to the first order we have 

Making use of the form (11) for the wave function and of the equa- 
tions (12 a) and (126) we have 

Htjf = WQ{ilfQ—<l))—eFr<l>QOBd 
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as far as powers of order F^. Thus neglecting powers of order we 
have 


eF 




J 00^ 


cos 6 dr 


j 


dr 


n n ' n 


0 0 
The integrations are elementary and give finally 

w = w„-^^. 


But in terms of the polarizability a we have 

W = W,-iocF\ 

so that, since the two formulae are equivalent, 

« = laK (16) 


This corresponds to a value 0*677. 10^^^ cm.® for the polarizability of 
the normal hydrogen atom. 

An alternative treatment of the polarizability, valid for any field, 
is as follows: the considerations of Chapter III show that the perturbed 


wave function is 


♦.+<^ 21 ^.. 


where ^Q(a;, y, z) is the wave function of the ground state, y, z) of 
any excited state, and 

= (17) 

The charge density in the perturbed atom, neglecting terms of order 
H*, is thus 

p(»,!r,*) = el0or+e’‘f’ 2 conjugatej. (18) 

The s^nd term is not in general spherically symmetrical and will thus 
give rise tb a dipole moment. Thus, for instance, if corresponds to 
an d-state and to a p-state, the product will be of the form 
^/(r), and will be as illustrated in curve (3) in Fig. 27, The change in 
the charge density of the atom is also illustrated. 

The dipole moment of the atom is equal to 

jzp{x,y,z) dr, 
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which, on substituting from (18) gives for the polarizability (dipole 


divided by field) 


a = 2e* 2 


|2„. 


K- 


■w,- 


(19) 


The quantities \Z^\\ here introduced for the first time, have con- 
siderable importance for the theory. 

It is convenient to introduce the 
dimensionless quantity defined by 

The quantity/,^ is called the ‘oscillator 
strength’ of the transition of 

the atom or molecule concerned. The 
reason for this is obvious. If we write 
Vn — (W^—Wo^lh and substitute in (19), 
we find for the polarizability ol, 

oc^—yis. ( 20 ) 

n ''' 

Now a ‘classical’ electron bound to a fixed point with a restoring force 
such that it would vibrate with frequency v would give a polarizability 
e^l^TThriv^. The real atom, therefore, behaves as though it were a 
collection of oscillators of frequency and strength/^. 

In Chapter X it will be shown that the quantities determine the 
intensities of the corresponding absorption lines. 

In the next section it will be shown that 



Fig. 27. Charge density in an atom. 
(1) iinpertinbed ; (2) per- 

turbed; (3) 0*^0. 


2/n=l 

n 

for a one-electron atom, or for a series of transitions in which only one 
electron takes part. Thus if, as is frequently the case, one absorption 
line V is much stronger than all the others, the polarizability a is given 
approximately by ^2 

(X 

as on the classical theory. 

The proof that for the hydrogen atom formulae (16) and (19) are 
equivalent is left to the reader. 


32.3. The f-sum rule 

We have in this section to prove that 
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As a preliminary we show that 

J J dr. (21) 

This may be proved in an elementary way from Schrodinger’s equation. 
Thus 2m 

V¥o+^W-m = o, 

and v^*+^(W„-V)^ = 0. 

Multiplying the first equation by ^^ 2 , the second by i/jqZ, integrating 
over all space and subtracting, we obtain 

Since == 

we have, making use of Green’s theorem, the required result (21). 

It follows that 

2/« = -2 2 f 'l’*^(>dr I* >kn^dT, 

and, by the argument given on p. 95, the right-hand side is 

It follows from partial integration that this is equal to J 
thus to unity. 

32.4. Calculation of van der Waah force hosed on wave mechanics 
As already stated, the term van der Waals’ force is used in this book 
to describe the force between two atoms or molecules which are far 
' enough apart for the overlap between the wave functions to be 
neglected. 

We shall calculate the attraction between two hydrogen or one- 
electron atoms, a and 6, at a distance B from each otiber. The atoms 
liije not necessarily of the same type; we denote by ujx) tte wave 
function of ah electron in the normal and excited states in atom a, 
and by t;o(r), vjx) the wave functions for an electron in atom 6. Since 
the overlap is neglected, it is not necessary to use aiitis 3 rmmetrical 
wave functions; we may locate electron 1 in atom a and electron 2 in 
atom 6, and take for the zero-order Wave function of the system 
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The interaction energy F(ri, Tg) between the two atoms is then 

7^41 1 , 1 1 

where r^, denote the positions of the nuclei of the two atoms. 

If we use the perturbation theory of Chapter III to work out the 
change of the energy of the system of the two atoms due to F, the 
answer will obviously be zero; the integral 

J F(ri,r,)K(r,)pdri 

is the electrostatic potential of an electron fg outside the unperturbed 
atom a. This will be zero since the unperturbed atom is spherically 
symmetrical. We have to carry the perturbation theory to a higher 
order of accuracy and calculate the energy using perturbed wave 
functions. 

By equation (10) of Chapter III the perturbed wave function is 


y'o(hK(h)+ 2 

w.n' 


(w,w'|F|0,0) 




( 22 ) 


Horn W^, and )Fo, W'^ denote the energies of the normal and excited 
states of the two atoms and 


(»i,w'lF|0,0) = IJ <(ri)t!;.(r2)F(ri,r8)M„(ri)?)„(rj)dTidT2. 

This means that owing to the interaction energy either atom may be 
perturbed and carry a dipole; but the position of the electron in one 
atom depends on the position of the electron in the other. The average 
dipole moment of either atom is zero, although the average interaction 
energy of the dipoles on each is not. 

If we use (22) to calculate the mean value of the interaction energy 
we obtain |(w,w'|F|0,0)| ^ 


To calculate (n, ri' 1 F|0, 0), we note that it simply represents the electro- 
static potential energy of two electrostatic charge densities 

euXuQ and ev*v^ 

in one another’s field. If these have dipole moments, the energy is, so 
long as ^ is great compared with the atomic diameter, 
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z is here taken to be along the line joining the two nuclei, and 

2ot. = / 2o»' = J 

It is possible to choose the states n so that only one of does 

not vanish for a given state. Thus the interaction energy is 

R»Z ’ 


and by symmetry this can be written 




m2 m^2 

^On *0n' 


'Z,W,+ Wo-W„- 


■W'-‘ 


(23) 


As in the last section, if for each of the atoms considered one transi- 
tion, of frequency v, v' respectively, has an ‘oscillator strength’ large 
compared with the others, the interaction energy between the two atoms 
may be written 

-fA 


I V 1 J_ 

\47r%»; 22®* 


(24) 


This term, therefore, will give the interaction energy between two 
molecules or atoms having strong absorption lines of frequencies v, v\ 
all other lines being weak. 

The theory given here of the van der Waals attraction between 
molecules is due originally to F. London and R. Eisenschitz.f When 
the summations are carried out it is found that the van der Waals 
interaction energy between two hydrogen atoms is 

.e2/a\« 175 




. volts, 


(25) 


(i2/a)® 

where a is the atomic unit of length (0-53 A). 

The same result has been arrived at by Lennard-JonesJ by the use 
of his generalized form of the SchrCdinger perturbation theory. Denot- 
ing the two electrons by 1 and 2 and their distances from one nucleus 
by r and those from the other by the Hamiltonian of the perturbation 
term is 

(26) 




If the distance between the two nuclei, 22, is large and it is assumed 
that the electrons are never very far from their respective nuclei so 


t F. London and R. Eisentchitz, Zeits,f. Phyeih, 60, 491 (1930). See also the account 
given by H. Bethe, Handb. d, F^ytik^ 241/1, 647 (1933). 

X J. E. Lennard’Jonea, Proc, Roy, Soc, A, 129, 598 (1929). (§ 18.4 above.) 
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that fi, P 2 small in comparison with fg, and JS, then by expanding 
the expression (26) for in the form 

Hi = -^^{3co8(ri,R)cos(pj,R)-cos(ri,P2)}+o|^j, 

and applying his perturbation theory Lennard- Jones derived the 
formula (26). 

Although the calculation of the long-range forces between hydrogen 
atoms is of great theoretical interest it is of little practical importance. 
The practically important case of helium atoms has been considered 
by Baber and Hass^f by a method involving various refinements of 
the London-Eisenschitz method. Calculations on the van der Waals 
interaction between heavier atoms have been performed J and are found 
to lead to results in reasonably close agreement with the experimental 
observations. 

The calculation of the van der Waals force between two hydrogen 
molecules has been carried out by Massey and Buckingham§ using a 
variational method. Particular interest is attached to this calculation 
because of the fact that the hydrogen molecule being axially sym- 
metrical possesses a permanent quadripole electric moment. Part of 
the force between two hydrogen molecules is therefore made up of the 
static interaction between these quadripole moments. It is found that 
the interaction energy in this case is 

where the angles a, jS, 6 determine the fixed orientations of the two 
symmetric diatomic molecules. Of the two terms the second is the 
dynamical dipole-dipole interaction of the type which gives rise to the 
van der Waals force between two hydrogen atoms in 5-states, and 
the first the quadripole interaction referred to above. It is found that 
whereas the average of F over all orientations is finite, that of 0 
vanishes, so that the static force is not of predominant importance in 
gas-kinetic phenomena. 

The interaction of the dipole system of one atom with the quadripole 
system of another has been considered by various authors.|| It is found 

t T. D. H. Baber and H. R. Hass6, Proc. Comb. Phil, Soc, 33, 263 (1937). 

i J. G. Kirkwood, Phya. Zeita. 33, 67 (1932); H. HeUmann, Acta Phyaicochim, 
VS8R. 2, 273 (1936) ; R. A. Buckingham, Proc. Boy. Soc, A, 160, 94 (1937). 

§ H. S. W. Maseey and R. A. Buckmgham, Proc. Roy, Irish Acad. 45, 31 (1938). 

H H. Margenan, Phya. Rev. 38, 747 j(1931); R. A. Buckingham, loc. cit. ; J. E. Mayer, 
J. Chem. Phya. 1, 270 (1933). 
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that this interaction gives rise to a term proportional to which is 
appreciable for moderate distances; for instance, Mayer calculates that 
the dipole-quadripole interaction amounts to 20 per cent, of the dipole- 
dipole interaction in the energy of certain crystals (NaCl, KCl, KI). 


32.5. The calculation of the van der Waals constants from observed 
polarizabilities 

In the investigations of Kirkwood, Hellmann, and Buckingham, cited 
above, on the van der Waals interaction between heavy atoms, a great 
deal of attention was devoted to attempts to relate the constant c of 
the van der Waals energy (— c/r®) to the coefficient of polarizability. 
The previous calculations were improved by Buckingham, f who made 
use of wave functions calculated by the method of self-consistent fields, 
and included the effect of electron exchange. 

For an atom with N electrons in a single shell, Buckingham found 
that the polarizability of the atom can be written in the form 




(28) 


where a is the radius of the first Bohr orbit and is a certain mean 
of the square of the radius of the outer orbits measured in atomic units. 
Introducing a mean radius of the outer orbits in centimetres by means 
of the equation „ 

we can write equation (28) in the form 

« = ( 29 ) 

9 a ^ ' 


The constant, c, of the van der Waals field between two unequal atoms 
(Ni and N^) is found by Buckingham to be related to the a, thus defined, 

«( o )of+ai' 
or, in terms of polarizabilities 

c = Me^ 


(31) 


The formula (31) is readily generalized in the case in which there are 
several shells each wit^ a characteristic size and polarizability ^f the 


t R. A. Buckingham, loc. oit. 
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(^D ^i) sub-group of electrons in the atom 1 by a(7ii, l^) and the number 
of atoms in it by then formula (31) is replaced by 


! = faM 2 ■ 




(32) 


f li)lv(niy 1!i)}+V{“(^2) h)}' 

where the summation extends over all pairs of sub-groups. 

By means of this formula the van der Waals force constants can be 
calculated from observed polarizabilities. A series of values of c was 
calculated by Buckingham and is shown in Table 4 below. 


Table 4 


Interaction 

c 

(IQ- ^^ergacm^) 

Interaction 

c 

(10~®®crgfscm.®) 

Interaction, 

c 

(10~®®ergrscm.®) 

He— Ho 

1*63 

A— A 

63*5 

Na+— Na-^ 

3-70 

He— Ne 

3-48 

A— Kr 

92*7 

Na+— K-^ 

10-4 

He— A 

9-89 

A— Xe 

135-6 

Na+— Rb+ 

15-1 

He--Kr 

144 



Na*-— Cs*- 

21-6 

■He— Xe 

20-7 

Xo— Xe 

293 







K'— K+ 

33-3 

Ne— Ne 

748 

I.i^— Li' 

0-097 

K+~Kb+ 

48-4 

Nt>— A 

205 

Li+— Na' 

0-584 

K^— Ch+ 

71-3 

Ne— Kr 

30-0 

Li+— K' 

1-51 



Ne— Xe 

42-6 

Li+— Rb+ 

2-21 

Rb+— Rb+ 

70-5 



Li+— Os'- 

3-14 

Rb+— Cs+ 

104 

Kr— Kr 

136 





Kr— Xe 

199 

•• 


C8+— Cs+ 

155 


33. Overlap forces 

In the last sections we have considered the van der Waals forces 
between atoms, which arise when the overlap between the wave func- 
tions is negligible. All other forces that occur between neutral atoms 
may be classed ‘overlap forces’ and arise only when the wave function 
of one atom overlaps that of the other. Among overlap forces can be 
included: 

(а) The valence forces of chemistry. 

(б) The forces of repulsion which always set in when atoms or ions 
with a closed shell configuration approach closely. 

Both types of force may be calculated by the same methods; two 
methods, in particular, are available. These are: 

(i) The method of London-Heitler. 

(ii) The method of molecular orbitals. 

Neither method will, of course, give an exact answer, but both can be 
treated as approximate methods from which a start can be made to 
build up more accurate wave functions. 
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In either method, the Hamiltonian E of the elections in the molecule 
must first be written down; an approximate wave-function T must then 
be found, and the energy of the molecule tjalculated from the formula 

= (33) 

The methods differ in the form taken by the approximate wave 
functions. 

The difference can be seen most simply in the case of a diatomic 
molecule, and the hydrogen molecule Hg will be considered first. In 
the method of Heitler and London wave functions are set up for an 
electron in one atom in the absence of the other atom. Thus in hydrogen 
let denote the wave function for an electron in one of the atoms 
of the molecule in the absence of the other, and ^^(r) the wave function 
of an electron in the other. In the original paper by Heitler and 
Londont on the hydrogen molecule Hg, and 0^ were taken to be 
wave functions for electrons in the ground states; thus if the two nuclei 
are located at points 72^, and denotes the vector r—Ra and 
the vector r— then 

^a(r) = ^^(r) = 

where c is the normalizing constant. In other cases, in particular in 
considering the carbon bonds, the wave functions will not necessarily 
refer to the normal state, since a lower value of W, and hence a better 
approximation, can be obtained otherwise. 

We then set up our approximate wave-function Y ; it must have the 
required antisymmetry in the coordinates of the two electrons, and will 
thus have one of the two forms (according to the arguments of Chap. V) 

Y = 4[^o(ri)^j(r2)+^„(rij)^6(ri)]x4(«i,«j), (34a) 

T = 5[0<.(ri)^»(r,)-^„(r3)^»(r,)]x.(ai.s,). (346) 

Here x^(^i>^ 2 ) is an antisymmetrical wave function in the coordinates 
of the two spins, giving therefore a singlet for each set of orbital wave- 
functions iftf, when the energy W is calculated from (33); 
is symmetrical and gives a triplet series of terms. J5 are normalizing 

factors. 

If the energy W of the molecule is now calculated according to (33) 
using the wave functions (34 a), (346) respectively, ‘we find 

W = A^^+J} in case (34 a) 

= J} in case (345), 

W. Heitler and F. London, Zeite.f. Physik, 44, 466 (1927). 
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where ® 

and ^o(ri¥(.(rsWa(rii¥6(ri) dri^Ti. 

The integral J is called the ‘exchange integrar; for hydrogen it is 
negative, and varies with distance more rapidly than Q. It thus leads 
to attraction and to the formation of a stable molecule for the singlet 
state (34 a) but not for the triplet states (346). 

In the method of molecular orbitals,! one starts with wave functions 
for the individual electron in which each electron is supposed to be 
shared equally between the two atoms. Wave functions can thus be 

’'P Ur) = Ur)+Ur) (36) 

or = tlijr)—>l>b(r)> (36) 

where and ij/f, are as defined above. Alternatively wave functions 
may be obtained which are a better approximation to the problem of 
an electron moving in some sort of self-consistent field for the whole 
molecule. One then builds up the molecule, putting not more than two 
electrons into each state corresponding to the wave-functions 

etc. 

The method of molecular orbitals is in general very suitable for 
handling the excited states of molecules and for the discussion of com- 
plex molecules; for the discussion of metals it is at present the only 
practicable method, the London-Heitler approximation proving too 
difficult for the mathematical methods now available (cf. Chap. VIII). 
It is probably not so suitable as the starting-point of detailed numerical 
work. 

The reason for this may be seen by setting up the wave functions 
for the lowest singlet and triplet states in hydrogen; these are, using 
the method of orbitals, 

^a(ri)0o<(r2)x^(«i>«s) (^'^) 

and [0<,(ri)i/-|5{rj)-^„(r8)0j5(ri)]xs(«i,«j). (38) 

The product >l>jTi)<l>Jrt) may be written 

This differs from the wave function (34 a) in including the terms 
^a(**i)^a(r 2 )> which represent both electrons in the same 

atom. The method of molecular orbitals thus allows electrons with 

t The name is due to J. E. Lennard-Jones {Trans, Faraday 8oc. 25, 668 (1929)) ; he 
and F. Hund {Zeits.f, Physik, 51, 769 (1928), ibid. 63, 719 (1930)) first introduced it. 
It has been extensively developed by R. A. Milliken {Rev, Mod. Phys. 4 , J9 (1932)) and 
by Coulson (refs. pp. 189, 192). 
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opposite spins to come into the same atom, and when the energy is 
calculated a large term will arise from the interaction between them. 
This term is certainly greatly over-estimated in the orbital method; the 
best approximation to the wave function would be of the type 

*'{lAa(*‘i)^a(r2)+lA6(ri)>^6(r2)}+iAa(ri)>/-6(rij)+i^„(r2)^ft(ri), (38a) 

with k between zero and unity. 

On the other hand, the wave function (38) for the triplet states 
reduces exactly to the same form as in the London-Heitler approxima- 
tion (346). In the orbital model, then, the symmetry of the wave 
function prevents two electrons with the same spin from coming into 
the same atom. 


33.1. Detailed discussion of the hydrogen molecule by the London- 
Heitler method^ 

In this section a discussion is given of the hydrogen molecule in its 
lowest electronic state treated according to the approximation of 
Heitler and London. The discussion of the last paragraph shows that 
in its lowest state the spins of the two electrons have a symmetrical 
wave function and must therefore be antiparallel, that the ground state 
is therefore a singlet (spin moment zero), and that the energy of the 
molecule is given by formula (33). In this section we have, therefore, 
only to set down the Hamiltonian and work out the normalizing factors 
B and the integrals Q, J. 

The normalizing factor will be discussed first. The wave functions 
0„(r), \jsffr) will be taken as normalized to unity. Then it is easily seen 

that ^2= l/(2+2A2), J52 = l/(2-2A2), 


where A = J^„(r)^6(r)rfT. (39) 

We have thus to calculate Q, J, and A. 

The Hamiltonian for the system is 

^2 ^2 ^2 p 2 ^2 pZ 

(40) 

Bn r^2 hz Hz ^ab 

Here denotes the distance between the. nucleus a and the electron 
with coordinate r-^\ distance between thd two electrons, and 

the distance between the nuclei. The term in ensures that 
W will give the total energy of the molecule, including the repulsion 
between the nuclei. . 


t For further details see, for example, Handbuch der Physik, vol. 24/1, 636 ^1933); 
Pauling and Wilson, Introduction to Quantum Mechanics, p. 340 (Cambridge, Mass.,. 
1936). 
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The simplest form to taT^e for the wave-functions ijj^(r), is that 
of the unperturbed hydrogen atom; this was the course adopted in the 
original paper of London and Heitler. With 
these wave functions the integral can be 
evaluated to give for the energies of 
the symmetrical and antisymmetrical states 


W — OW I I ^1+^2 


W — OW I I ^ 


12 


(41) 

(42) 



Fig. 28. Coordinatoa used for 
the hydrogen molecule. 


where Wjj is the energy (— me^/ 2 ^ 2 ) Qf 

ground state of the hydrogen atom, A is defined by equation (39) and 
hence is 




e-<ra+nOla dr,. 


(43) 


and and denote the integrals 

Wn = f f 

J J Vw W 

Wa hil 

Wn = ¥^jj ^(r„i)^(»-aa)^(rM)'AKa){5^-;;J 


1 

1 

1 

--1 


~ra2‘ 

hi 


1 

1 

--1 

drj dr 


hi 

h2) 


(44) 
dti dr^ 


The problem is thus reduced to the evaluation of the integrals A, 

Wia. 

To evaluate the integral A we introduce confocal elliptic coordinates 
f , ij defined by the relations 




^a+h 


Rob’ Rob 

and the azimuthal angle The variables f , rj so defined lie in the ranges 

l<f<oo, — 1 ^ 7 j<Cl and the volume element becomes 

dT^lRl,(P-'q^)d^d7^d<l>. (47) 




(46) 


Substituting from equations (46) and (47) into (43) we obtain 
00 1 

• A = § J df j (f*-y*)dy = (l (48) 

1 "1 
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In the expression (44) for the integral the first term may be evaluated 
by the use of the elliptic coordinates | and tj, the second and third 
directly. The final result is 

te+8-4V-6^)- 

The integrals involved in the evaluation of W 12 are cumbersome; indeed, 
that involving l/rjg presents some difficulty. Heitler and London 
developed an approximation for its value was later derived by 
Sugiura,t who found the expression 


W - , 103 6& 11 


+ 


+log + A'^Ei | ^ j - 2AA'Ei| - , (50) 


' 5E, 

where A is given by equation (48), A' by the equation 




a 


ab 

3a2/ 




y is Euler’s constant, and Ei denotes the integral logarithm 

z 



Ei(Z) : 


j? 


dt (Z<0). 


Substituting from equations (48), (49), 
and (50) into equations (41) and (42) we 
obtain expressions for the energies Wg 
and as functions of J?^. The varia- 
tion of these functions with is 
shown in Fig. 29; the function de- 
creases monotonically while Wg has a 
minimum at = 1*518. Wj^ corre- 
sponds then to repulsion at all distances 
so that a stable molecule is not formed. 
On the other hand, corresponds to 
the attraction of the two hydrogen atoms with the formation of a stable 
molecule. .The equilibrium value of the relative position of the nuclei 
18 given by ^ ^ ^ 

which is in excess of the known experimental value 0*7395 A. The 
corresponding theoretical value of the heat of dissociation is 3*14 eV 


Fig. 29. Energies of the symmet- 
rical and antisymmetrical states of 
Hg as a function of the distance 
between the atoms. 


t Y. Sugiura, ZeUa.f. Phyaik^ 45, 484 (1927). 
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in contrast to the observed value 4*48 eV. The vibrational frequency 
of the nuclei, determined from the curvature of the function Wg in the 
neighbourhood of its minimum, comes out as 4,800 cm.”^; the value 
obtained from observations of band spectra is 4,405*3 cm.-^ 


33.2. Improvements in the approximation of London and Heitler 
A simple improvement is to use for ^^(r), 0^(r) not the wave functions 
for an electron in an unperturbed hydrogen atom but the wave functions 
for an electron moving in the field of a charge 2 , which will then be 
chosen to minimize the energy for each value of This was first 
done by Wang,*|‘ who found a value of 2 , at the interatomic spacing 
giving minimum energy, given by 2 = 1*166. 

The addition to the London-Heitler wave functions of polar wave 
functions corresponding to electron configurations H+H- and H”H+ has 
also been considered by Weinbaum. J Weinbaum takes a wave function 
of the type (38 a) discussed above, with 0(r) as before the Is wave 
function for an electron moving in a charge ze. He finds 2 = 1*193 and 
k — 0*256 for the values of the parameters giving minimum energy. 
The fact that k is less than J shows that the London-Heitler is a better 
approximation than that of molecular orbitals, which gives k = 1, for 
the hydrogen molecule. 

Other improvements in the wave functions are: 

(a) That of Rosen,§ who takes for ^(r) a mixture of s and p functions 
of tt. type 


He finds (T = 0*10. 

(b) Further work by Weinbaum (loc. cit.), who combines this refine- 
ment with that of polar states. He finds 

2 ' =1*19, (7 = 0*07, fc = 0*176. 


Table 5 



Heat of 

diaaociation (eV.) 

(A) 

Vibrational 

Jreqmncy{om.-^) 

Heitler-London 

314 

0-80 

4800 

Wang. 

3-76 

0-76 

4900 

Weinbaum (ionic terms) 

4*00 

0-77 

4760 

Rosen (addition of p terms) 

4-02 

0-77 

4260 

Weinbaum (ionic terms and p terms) 

4-10 



James and Coolidge .... 

4-722 

0-74 


Experiment 

4-477 

0-7396 

4406-3 


t S. C. Wang, Phys. Rev, 31, 679 (1928). 

X S. Weinbaum, J. Chem. Phya. 1, 693 (1933). 
§ N. Rosen, Phya. Rev. 38, 2099 (1931). 
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(c) A much more complicated wave function set up by James and 
Coolidge,t in which terms in fjg are included. 

The results of these calculations are given in Table 5.$ 

For further details the reader is referred to the original papers, or 
the accounts by Pauling and Wilson (loc. cit.). 

33.3. S(me other applications of the London-Heitler method 
Repulsive forces between atoms and ions with the closed shell configura- 
tion. Only for helium and neon have detailed calculations of the inter- 
action energy between two atoms with the closed shell configuration 
been made. The overlap energy is found to be repulsive at all distances. 
The reason for this may be seen qualitatively as follows: each electron 
in one helium atom will interact with two electrons in the other, one 
of which has spin parallel to its own and one which has spin antiparallel. 
As regards the interaction between the electrons with parallel spins, 
we know that the spin wave function must be symmetrical and the 
orbital function therefore antisymmetrical. The contribution to the 
energy of the molecule is thus of the form 

Q+J^ 

As regards the interaction between electrons with antiparallel spins, the 
spin wave function is as likely to be symmetrical as antisymmetrical, 
and the contribution to the energy of the molecule is thus of the form 

i(Q+J)+i(Q-J)-Q- 

The total energy of the pair of interacting atoms is thus 

2Q-\- J. 

The exchange integral J being negative, repulsion between the mole- 
cules results at all distances. 

Detailed computations of the interaction between two helium atoms 
along these lines have been made by Grentile§ and Slater. |i Slater 
obtains a complicated expression for the interaction energy that can 
be approximated closely by 

*2 

. 7*7 X an = = 0-529 A. 

' ' me^ 

This expression, added to a term giving van der Waals’ attraction, has 

■f H. M. James and A. S. Coolidgo, J. Chem. Phys. 1, 826 (1933). 
j From Pauling and Wilson, The Structure of Band Spectra. 

§ G. Gentile, Zeite.f. Phyaik, 63, 796 (1930). 

II J. C. Slater, Phys. Rev. 32, 349 (1928). 
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been extensively used for the calculation of the equation of state of 
helium gas.f An expression for the complete interaction energy given 
by Slater and Kirkwood is 

7.7e-2-43B/<i,_ 10-10 ergs. 


{H/aor 

This curve gives a shallow minimum of depth 
10"^® ergs (6 x 10“^ eV) at 2-9 A. It is 
not certain whether the ‘potential hole’ is 
deep and wide enough to permit the forma- 
tion of a stable molecule. 

Computations have also been carried out 
for neonf using the same method. For the 
repulsive potential is found 

MSxlO^ e-^^/o-395 eV, 
where R is the distance between the atoms in atomic units. 



33.4. The hydrogen molecule by the method of molecular orbitals 

The elements of the method of molecular orbitals have been discussed 
above; the wave functions of the individual electrons are taken to be 
the solutions of some SchrOdinger equation 

0 

where V is some field which is the same for all the electrons. The 
essential points about the approximation are thus: 

(a) that each electron is thought of as being shared between all the 
atoms of the molecule, its position being uncorrelated with the 
positions of the others (at any rate those of opposite spin) ; 

(b) the wave functions for the individual electrons are orthogonal 
to each other. 

To illustrate the method we shall consider the ground state of the 
hydrogen molecule.§ Suppose we denote by (/>(1) the molecular orbit 
for electron 1 in the field of the two nuclei A and B, then in the 
molecular orbital treatment we write for the wave function of the 
hydrogen molecule ^ ^ (61) 

t Cf. W. H. Keesom, Helium (Amsterdam, 1942); J. de Boer and A. Michels, Phyaica^ 
6, 409 (1939); H. S. W. Massey and R. A. Buckingham, Proc. Roy. Soc. A 168, 378 
(1938); E. Beth and G. E. Uhlenbeck, PhyaicAi, 4, 915 (1937); J. C. Slater and J, G. 
Kirkwood, Phya. Rev. 37, 682 (1931). 

t W. E. Bleick and J. Mayer, J. Chem. Phya. 2, 262 (1934). 

§* C. A. Coulson, Trana. Faraday Soc. 33, 1479 (1937). 
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where xa(^i> ^ 2 ) denotes the spin wave function which is usually omitted 
since it makes no further contribution either to the wave function or 
to the energy. The figures 1 and 2 stand for and Tg. 

The molecular orbital method therefore depends on the solution of 
the wave equation for a single electron moving in the field of two fixed 
protons. The solution of the problem of the hydrogen molecular ion 
(H^) is thus fundamental to the discussion of molecular structure by 
the method of molecular orbitals and must be investigated first. 

The Hamiltonian of the system may be written 


2m 

To solve the SchrOdinger equation 

= W<t, (62) 

we transform to the elliptic coordinates rj, by equations (46) with 
azimuthal angle x- these coordinates 

1 1 ^ ^ 

Introducing the radius of the lowest orbit of the Bohr atom 
(a^ = ft^/me^), transforming the Laplacian to these coordinates, and 
writing 2 

^ab 

we find that equation (62) reduces to 


+ = 0 . 

This equation is separable; writing 

= x(mmx)^ 

we find that the functions X, F, and ® satisfy the ordinary differential 
equations ^ 2 ^ 


dx 


= 0 
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where A, are constant introduced when the equations are separated 
and the functions /, g are defined by 


M) = ^ W+^)+/^. 


In the lowest possible electronic state A = 0, so that writing 


we have 



Xdr 

^ YdT,’ 

(63) 

II 

+ |2-1 ’ 

9{v)+^G 

dr] 

(64) 


The equations (54) have been integrated numerically by Burrau.f 
The range of the variables is 1 < ^ < oo, — 1 < < +1; by assigning 

a value to the ratio M^la^ the solution of the equations can be 
expanded in a power series for 0 < J and in another series for 
^ ^ 7 j The two series have the same sum at J only if w has 
a certain value. In this way w (and hence W) is determined for each 
value of the internuclear distance ; the results obtained by Burrau 
are shown in the table below. The accuracy of Burrau ’s calculations 


^abl^o 

0 

1*0 

1-3 

1-6 

1*8 

‘2-0 

2-2 

2-4 

2-96 

00 

aoF/e® 

+ 00 

-■0-896 

-MIO 

-M86 

-M98 

-1-204 

-1-200 

-1-192 

-1-158 

-1-000 


has been improved upon by HylleraasJ who assumed a solution of the 


lurm 00 

Y(v) = i 

nHAl 

and also by Steensholt.§ These treatments correct Burrau’s values by 
very small amounts; all three give = 2*OOao for the equilibrium 
value. This corresponds to a value R^^ == 1*06 A in agreement with 
Richardson’s spectroscopic estimate|| of the distance between the pro- 
tons in the ion. 

A method such as that of Hylleraas is necessary to obtain accurate 
forms for the wave function ^ for the problem; it is found, however, 
that by taking ^ to be a linear combination of atomic orbitals it is 
possible to construct a wave function which is sufficiently accurate for 
molecular calculations. If denote the wave functions of the 

electron treated in atoms A and B, supposed normalized to unity, then 

one may take <l> = AUra)+BUh)- 

f 0. Burrau, Kgl. Danske Vid. Selskab, 7, 14 (1927), 

* t E. A. Hylleraas, Zeita.f. Physik, 71, 739 (1931). 

§ G. Steensholt, ibid. 100, 647 (1936) ; Norske Vid.-Akad. Avh. No. 4, 1936. 

II 0. W. 'Richardson, Trans. Faraday Soc. 25, 686 (1929). 
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The constants A and £ are found by minimizing the energy 

W = f drlj ,l,*4 &T\ (55) 

in this way we find A = ±B = (2±2^)-^, 
giving for the wave functions 




(56) 

and 

! 

II 

(57) 


The form of these two wave functions is shown in Fig. 31. 



Fig. 31. 


For the wave function wo shall assume the form 

•I’K) = ( 58 ) 

differing from the wave function of the ground state of the hydrogen 
atom by the inclusion of a ‘screening constant’, c, which will be a 
function of the nuclear separation Substituting from (68) into 
equation ^66) and inserting in equation (66), we obtain for the energy IT, 


W^^Ifc^-Oc+^\ 

-^dbl 


(69) 


where F(p), 0{p) are known functions of the variable p = the 

expressions for F, 0 are rather cumbersome but may be evaluated 
in the same way as the integrals in § 33.1. For a given nuclear separation 
R^ the best value of c is given by the equation 

m =0. 


1 / 


i?a> 


the suffix R^ denoting that is assumed constant during the 
differentiation. It follows immediately from (69) that c is a root of the 


equation 






m 

dp 


( 60 ) 
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The values of c determined from this equation are shown in Fig. 32, 
and the corresponding curves for If as a function of are given in 




Fig. 33. Also shown on the same diagram are Pauling’s curves obtained 
by taking c = 1 in equation (58) and inserting this wave function in 
(65) with the combinations (56) and (57), and the curves obtained by 
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Hylleraias and Steensholt from the accurate forms of in spheroidal 
coordinates. 

It appears that the simple forms (56) and (57) approximate quite 
well to the exact wave functions for the molecular hydrogen ion derived 
from the solution of the SchrOdinger equation in confocal elliptic 
coordinates. 

Let us now consider the hydrogen molecule for which the Hamiltonian 
is (40), (cf. Fig. 28). In the molecular orbital method the wave- 
function ^ is of the form (51). Dropping the spin factor and substituting 
for ^(1) and ^(2) from equation (56) we have 

where denotes a normalization factor. 

Using the wave function (61) with 0(rai) defined by (58) we find that 
the energy can be put in the form 

W ^-l2Fc^-2Gc+Jc+^], (62) 

®o\ ^abl 

where F and 0 are the same functions of /> as in the case of the molecular 
hydrogen atom, and J is another function of the same type arising from 
the presence of the term l/r^g in the Hamiltonian (40). Applying the 
variation method as before we find that c is a root of the equation 

Calculating the explicit forms of F, G, J we can draw a curve of the 
type Fig. 34 which gives c as a function of Babl^O’ The values for the 
ion H^ are also shown on the same scale. It will be seen from this 
diagram that the repulsion between the two electrons in the neutral 
molecule tends to diminish the value of c. 

The corresponding curve for the energy (62) is shown in Fig. 35 
together with the curve obtained by Coulson using the Heitler-London 
method with the variation principle. The constants of the hydrogen 
molecule obtained by these methods of approximation are shown below. 


1 

1 

^abl^o 

Binding 

energy 

{volte) 

a 

Molecular orbital . 

iSl 

3-470 

1-197 

Heitler-London 


8-761 

M66 

Observed 

mm 

4-477 

•• 
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It would seem that in the simple case of the hydrogen molecule the 
molecular-orbital and the electron-pair treatments are almost equally 
good. It is found, however, that in more complicated polar molecules 



Fig. .S4. r as a function of for H 2 and H.j . 



Fig. 36. Hj molecular energy curves. 

the molecular-orbital method is superior to the electron-pair (or Heitler- 
London) method? The method of molecular orbitals has been applied 
to a few molecules.f 

t See, for example, C. A. Coulson, Trans. Faraday 80 c. 33, 388 (1937) (CH 4 ); Proc. 
Camh. Phil. Soc. 31, 244 (1935) (HeJ*); Proc. Boy. Soc. 169, 413 (1939) (polyenes and 
aromatic molecules) ; J. K. Knipp, J. Chem. Physics, 4, 300 (1936) (LiH +) ; H. M. James, 
J. Chem. Physics, 3, 9 (1936) (Lia+); C. A. Coulson and G. S. Rushbrooke, Proc, 
Camb. Phil. 80 c. 36, 193 (1940) (unsaturated hydrocarbons); J. E. Lennard-Jones and. 
C. A. Coulson, Trans. Faraday Sw. 35, 811 (1939) (polynuclear hydrocarbon molecules). 
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34. Distribution of momentum in the hydrogen molecular ion 

A great deal of theoretical work has recently been done on momen- 
tum distributions in molecules.f In addition to its intrinsic interest 
the determination of the momentum distribution of the electrons in 
a molecule is of importance since it determines 

(i) the shape of the Compton line when X-rays are scattered from 
the molecule ;t 

(ii) the distribution of the fast inelastically scattered electrons from 
the molecule.§ 

Since these quantities can now be determined experimentally with a 
fair degree of accuracy it is important to attempt some kind of theo- 
jcetical treatment of the distribution of momentum in molecular systems. 
Although Coulson and Duncanson have discussed the momentum 
distribution in several molecules we shall confine our attention to that 
in the molecular hydrogen ion 

According to transformation theory, the momentum wave function 
x(p) of the molecular hydrogen ion is obtained from its space wave 
function <^(r) by means of the Fourier integral 

x(p) = pJe-M^(r)iT, (63) 

where for the symmetrical wave function we may assume ^ is given 
by equation (56) with the ^^s given by equation (58). If we use atomic 
units (Chap. VI above) and denote the position vectors of the nuclei 
by r„, Tft, then (63) assumes the form 


X(P)= -^/oToaV- 


^(2+2A) 


^vhere A(jp) = J ■■ 


2»c‘ 


^(l)2+c2)2 


(64) 

( 66 ) 


The density function is then 

x(P)x*(P) = (66) 

Now the function A(p) is simply the momentum distribution of the 
atomic orbital (58), so that the density (66) is the same as it would be 

t Hiciw, Phys. Rw, 52, 436 (1937); C. A. Coulson, Proc, Qamb. Phil. Soc. 37, 66, 
74 (1941) ; W. E. Dunoanson, ibid. 37, 397 (1941) ; C. A. Coulson and W. E. Duncanson, 
ibid. 37, 67, 406 (1940), ibid. 38, 100 (1942). 
t J. W. M. Du Mond and H. A. Kirkpatrick, Phys. Rev. 52, 419 (1937). 

§ A. L. Hughes and M. M. Mann, ibid. 53, 60 (1938). 
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if the electron were confined to one of the nuclei, except for the factor 

l + C08(p.RJ 

1+A 

which gives rise to a kind of diffraction effect and is consequently called 
the diffraction factor. 

The density function x(P)x*(P) given by equation (66) has symmetry 
around the intemuclear axis, so it is desirable to draw the contours 
of equal momentum density in a plane through this axis. The result 
is shown in Fig. 36, from which it will be observed that the contours 


Scale 

0*2 unit of 
momentum 



Dir ection of 
intemuclear aiis 


Fio. 36. (/ontours of constant momentum density for H,/. 
(From Coulson, Proc. Camb. Phil. »Soc. 37, 62 (1941).) 


are approximately ellipses with major axes perpendicular to the 
direction of the intemuclear axis; there is thus greater probability in 
directions perpendicular to the bond than along it. 

The probability Ix(Pz)^Px component of the momentum 

along the intemuclear axis has a value between pj. and Pj.~^dpx 
determined by the equation 


— 00 —00 


In the case of a hydrogen atom =0, c = 1,A = l,so that 


^xiVx) 


8 

37 ^ 1 +^)'* 


( 68 ) 


The variation of the functions (67) and (68) with p^. is shown graphically 
in Fig. 37, from which it is evident that in the molecular ion there are 
fewer large values of pj^ than in the atom ; if we calculated the probability 
functions Iy(Py), Iz(Pz)y perpendicular to the intemuclear axis we should 
obtain precisely the opposite effect. 
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For the application of these principles to more complex molecules 
the reader is referred to the papers by Coulson and Duncanson cited 
above. 



Fig. 37. Distribution of component of momentum 
along the intemuclear axis. 

35. Qualitative results for more complex moleculesf 
35.1. The principle of maximum overlapping 
The mathematical difficulties encountered in the calculation of the 
energies of molecular orbits, other than those of Hg and are so 
formidable that problems of molecular structure are usually discussed 
with the aid of a new assumption known as the principle of maximum 
overlapping. It is assumed as a first approximation that the exchange 
energy is due to the overlapping of the electron clouds surrounding the 
nuclei and that the energy of a molecular orbit is lowest (i.e. the binding 
energy is greatest) when the atomic orbits which it resembles overlap 
as much as possible. 

In applying this principle we require a knowledge of the probability 
distribution functions for atomic orbitals. We saw in the case of the 
hydrogen atom that a Is wave function is of the form 

m 

so that the angular dependence of the wave function for an atomic 
orbital is of the type shown in Fig. 38 a. The angular dependence of 
the wave functiqn 

for a 2j)^-atomic orbital is shown in Fig. 386. The squares of these 
functions give the angular dependence of the probability distribution 

t See L. Pauling, The Nature of the Chemical Bond, chap, iii (Cornell Univ.); C. A. 
Coulson, Proc. Roy, Soc. Edin. A, 61 , 116-39 (1941-2). 
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functions. The 2py, orbitals are of the same form as the 2p^ one 
except that the axis of symmetry is Oy, Oz respectively in place of Ox. 
In the 5-orbit the electron is practically confined within a sphere 
whereas an electron in a 2p-orbit is most likely to be found around 
or within a region with the dumb-bell shape of Fig. 38. 



Fig. 38. Polar representation of absolute 
values of the anguhir wave function for 
k-orbital and 2p-orbitals. 


The use of the principle of maximum overlapping is most simply 
illustrated with reference to the water molecule H 2 O. Each hydrogen 
atom is in a l 5 -state. The oxygen atom has the electronic structure 

(l5)2(25)2(2p)S 

the two inner -electrons keep close to the nucleus and do not affect 
the binding, while those filling the 28 atomic orbital form a closed sub- 
group round the nucleus. Of the four remaining electrons two fill the 
2p^-orbital and do not take part in the formation of molecules (since 
they are paired together); finally, there is one electron in the 2p^ orbit 
and one in the 2py orbit. They alone take part in molecule building. 
We assume further that the 28 and 2p wave functions do not mix. 
If this is so, then by the principle of maximum overlapping the greatest 
binding energy is obtained by placing one hydrogen atom along the 
ir-axis and the other along the y-axis, as shown on the left in Fig. 39, 
which is taken from Coulson’s paper.f We see immediately that the 
angle between the bonds, HOH, should be about 90 ; this derivation, 
however, ignores the mixing of the 2p and the 28 wave functions and 
the repulsion between the two hydrogen atoms. These corrections have 

f C. A. Coulflon, loc. cit., Fig. 6. 
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the effect of incre^g the bond angle. The observed bond fl.n gl«s for 
water-t 3 rpe molecules (oxygen family) are:f 

HjO 


F*0 

C1*0 

HjS 



106“ 

100 “ 

116“ 

92“ 



Fig. 39. The water molecule. 


35.2. Carbon molecules: hybridization of wave functions 
In the carbon atom two of the six electrons are in iT-shell orbits and 
take no part in chemical binding. The other four electrons are i^-shell 
electrons whose orbits are chosen from 25 , 2py, 2p^, The nature 
of the bonds that carbon forms in any particular molecule determines 
^ ^ which these four orbits are formed. In carbon, since the 

- .energy of the 25 is nearly equal to that of the 2p-orbits, we may form 
compound,, or hybrid, orbitals by taking linear combinations of the 25 , 
wave functionfl. Denote the latter wave functions by ^( 25 ), 

and assume they are orthogonal and normalized. From 
them form* four new wave functions 0^ (r = 1,2, 3, 4) defined by the 

= ®r0(25)+6,.0(2pa.)+c,.0(2py)+dy0(2p^) (69) 

== 1, 2, 3, 4), where the a’s, 6’s, c’s, and d’s are constant. The energy 
tif the. orbits corresponding to these wave functions is that of the nearly 
equ^^l 2b and 2p states. If we denote the direction cosines of a certafn 
direction by «= then the wave function 


. 0(2jp<) = k 0(2i)*)+% 0(2pJ (70) 

1$ of the typ4 0(2p) and has its maxima along the directions (Z^, nf) 
^ and With this notation we may write equations (69) 

iii the form 


-“.®<0{2^)+&<0(2p<) ‘ 
t A. F, Welb, Stjyctural Inorganic Chemistrg (Oxford, 1944). 


(71) 
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{i= 1, 2, 3, 4), and note that the maxima of this function are also along 
the direction r^. For the wave functions (71) we have 

J tjiiilijdT = a^aj+bibj j ^(2^),)0(2py)dT 

and J il>(2pi)iji{2pj) dr = = cos fly, 


SO that the orthogonality and normalization relations yield the ten 
equations 


(i,j = 1,2, 3,4) for the determination of the eight constants 6^. 
In this equation 6^^ denotes the angle between the directions and r^; 
cos 9^^ is taken to be unity. 


Case (i). Tetrahedral bonds. If we take the directions r^, fg, fg, 
to be normal to the faces of a regular tetrahedron then, for i j, 

cos^^^ = 

and equations (72) reduce to 

a2+62 = 1, a2-J62 = 0, 

where a ~ a^ — a^ — a^=^ a^^h — — — h^. We thus have 

= i ^ = K3, 

so that the wave functions are 

('^3) 

If we take as axes the lines joining the mid-points of the edges of the 
regular tetrahedron the vertices have coordinates of the type (1, 1, 1), 
(1,— 1,— 1), (— 1, 1,— 1), (— 1,— 1, 1), and the wave functions (73) 
become 

Functions of this type are called tetrahedral orbitals; in each of them the 
electron is concentrated in a particular direction about which the wavi 
pattern is symmetrical. The wave functions of this type have a ftum 
resembling that of Fig. 40. 

If, when it is ready to form a molecule, the carbon atom has its four 
unpaired electrons each in one of these tetrahedral orbitals, the principle 
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of maximum overlapping shows that it can form four equivalent bonds 
directed to the four corners of a regular tetrahedron whose centroid 
coincides with the carbon nucleus. The four bonds thus make angles 
of 109° 28' with each other. In this way we can account for the charac- 
teristic tetravalency of carbon — cf. the methane molecule CH4 (Fig. 41). 
When the four radicals which are bonded to the carbon atom are not 



Fig. 40. Single tetrahedral 
orbital. 



Fig. 41. The methane 
molecule. 


all identical, it can be shown that their mutual repulsions and different 
electron affinities do not affect the tetrahedral angles by more than two 
or three degrees. The tetrahedral bond can also be shown to be very 
stable. 

Case (ii). Trigonal bonds. If we take the directions r^, Fg, Fg to lie 
in a plane making angles 27r/3 with each other and take F 4 to be normal 
to this plane, then 

cos ” COS ^23 — COS ^gj — • 

COS 014 = COS 024 = COS0g4 = 0, 

and so from equations (72) we have 

tti = ttg = Us = 1/V3, 

6l = 62 “ ^8 “ 

(^4 = 64 = 0. 

Choosing F4 to lie along the z-axis and Fj to lie alqng the a;-axis we then 
= {^{2a)+V2^(2i>*)}/V3 
0„ = {V20(2«)-0(2i)J+V30(2p,)}/V6 
0„i = {V20(2«)-0(2pJ-V30(2i)„)}/V6 
0 IV = 0(2r*)- 

These orbitals are called trigonal orbitals. They resemble the tetragonal 
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orbitals in the sense that the first three (which are shown in Fig. 42) 


possess a marked directional property. The 
principal directions lie in one plane and are 
separated by angles of 120 °; the fourth orbital 
is an undisturbed 'dumb-bell’ with its axis 
normal to the plane containing the other 
three. There is one electron in each of the 
trigonal orbits when a carbon atom is about 
to form an unsaturated molecule, such as 



ethylene (C 2 H 4 ), in which the bond angles 
are 120 °. 


Fig. 42. The three 00 -planar 
trigonal orbitals. 


35.3. The ethylene double-streamers 

As an illustration of the occurrence of trigonal wave functions we 
shall now consider the structure of an unsaturated molecule, ethylene: 


H H 



in which there is a double bond. The two carbon atoms are in the 
trigonal state so that we can form five ‘sausage’ bonds each of two 
electrons, such that there are three bonds from each carbon atom. 
Normal to the plane of these orbitals there is a dumb-bell orbital 
corresponding to a mobile electron which is unpaired and not engaged 
in the formation of a bond as yet. The criterion of maximum over- 
lapping demands that the bonds in the CHg plane are at angles of 120 ° 
with each other ; the best we can do with the mobile electrons from 
the two carbon atoms is to pair these together somehow. If the two 
CHg planes of the molecule coincide the two dumb-bells are in parallel 
directions so that they overlap most in this direction. These 
electrons will then form themselves into molecular orbitals of the type 
shown in Fig. 44, in which the probability function has the form of 
two streamers one on each side of the C 2 H 4 plane. For such a molecular 
orbital the wave function will be approximately 

Both electrons have, of course, the same ‘double-streamer’ wave func- 
tion; it should be noted that it is not a case of the upper streamer 
representing one electron and the lower streamer the other. The two 
streamers are inseparable, constituting the additional factor which 
, converts a single bond into a double bond. 
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A characteristic feature of the double bond is its high reactivity. 
Since the streamer electrons are not so tightly bound as the other 
electrons — ^the atomic orbitals from which they are built do not over- 
lap as much — it is easier to form new molecules by disengaging them 
from one another and linking them with other atoms. 



Fig. 43. Overlapping of atomic 
orbitals in ethylene. 


W C+Z) 

► •A 

6 

Fig. 44. Mobile electrons in ethylene. 

Similarly, the triple bond is formed by the addition of a further pair 
of streamer electrons in a plane normal to that of the other pair of 
streamers. 

35.4. I'he benzene molectUe (CgHe) 

A molecule of some importance in organic chemistry is the benzene 
molecule and it is one of the classic problems of cjiemistry to determine 
the distribution of electrons in this molecule. Now a number of sources 
indicate' that all the carbon and hydrogen atoms lie in one plane. We 
see then that for this to be possible each of the carbon atoms must be 
in a trigonal state. The atomic orbitals (Fig. 45) then form twelve 
* sausage’ bonds and the molecule forms a plane ring containing six 
carbon atoms. There are then six mobile electrons left unpaired; their 
orbitals are dumb-bells whose axes pass through the carbon nuclei and 
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are normal to the hexagon formed by the carbon nuclei (Fig. 46)^ 
These electrons will, however, interact and will be free to move about 
from one carbon nucleus to another, forming molecular orbitals of the 
type shown in Fig. 46. The streamer bonds which we encountered in 
ethylene are now spread out over a wider space — ^we have two streamers 
going right round the molecule one on either side of the carbon hexagon. 



Fig. 46. Overlapping of atomic 
orbitals in benzene. 


Fig. 46. Mobile electrons 
in benzene. 


If we write dumb-bell orbitals the molecular 

orbitals streaming round the molecule will, by an analogy with the case 
of Hg, be of the form e 

r=l 

The constants Cj. have to be chosen (by a minimum energy method) 
so that this function is as good a solution of the wave equation as 
possible. It can be shown that there are six possible molecular orbitals 
of the required character each with different values for the and each 
having a different value for the energy. If Eq is the energy of an isolated 
2pg, atomic orbit and jS is a certain constant, of negative value, whose 
magnitude is known from comparison measurements on ethane and 
ethylene, these molecular energies are 

Eq—2P, 

The energies of the first three orbitals are thus less than that of an 
isolated 2pg atomic orbit; these levels are called hording orbits. There 
will be two electrons in each of these bonding orbits in the normal state 
of benzene so that their energy is QE^+Sp, If we had supposed that 
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the molecule had a configuration 


in which single and double 


bonds alternate, the energy would have been 6j^Q+ 6j3. The extra energy 
2j8 gained by removing the restriction to localization has been called the 
reaomnce energy. The value 2j8 is about 30 k.cals. which agrees quite 
well with the experimental value (37 k.cals.). 

Similar calculations to those we have outhned briefiy for benzene 
have been carried out for other molecules of this type and have been 
found to explain many of the well-known facts of organic chemistry. 
For example, it has been shown that the characteristic colour of many 
of the organic dyes may be attributed to the excitation of a mobile 
electron from a bonding orbit to an unoccupied orbit with energy 
exceeding Eq. The results of calculations of the resonance energies of 
a few unsaturated molecules are embodied in the table below, f It will 
be observed that the agreement with the experimental values for the 
resonance energies is reasonably good when we consider the nature of 
the approximations involved. 


Table 6 


Molecule 

Resonance energy {k.cals.) 

Calculated 

Experimental 

Butadiene 

6 

5 

Hexatriene 

11 


Octatetraene 

17 


Benzene 

30 

37 

Diphenyl 

66 

87 

Naphthalene 

63 

76 


f C. A. Coulson, op. cit., p. 130. 
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36. Cohesive forces in ionic crystals 


The theory of the cohesive forces in ionic crystals developed by Max 
Bornt and his collaborators was one of the first successful applications 
of atomic physics to the theory of solids. Though not strictly an applica- 
tion of quantum mechanics, it forms the basis of subsequent develop- 
ments, and it will be necessary to give an account of it here. 

The aim of Bom’s work is to account for the lattice energy, lattice 


constant, crystal structure, and elastic properties 
of ionic crystals in terms of an atomic model. 
The theory has been applied most successfully to 
alkali halides, and will be explained here with 
reference to them. Alkali halides have the simple 
cubic structure; this is shown in Fig. 47. 

The theory worked out by Bom rests on the 



following assumptions: 

(a) The crystal is built up of positively and 


• No 

oa 


negatively charged ions, carrying charges ie re- Fig. 47. Sodium chloride 
spectively. Ions of unlike sign therefore attract structure, 

each other and ions of like sign repel, the force in each case being equal 


to e^/r^. 

(b) Any two ions repel one another with an additional force that is 
practically equal to zero until their separations fall below a certain 
value, when the repulsion increases rapidly. We shall denote the 
potential energy due to this repulsion by w(r) and call it the overlap 
energy, because it only comes into play when the electronic clouds of 
the ions overlap. The total potential energy of a pair of ions of unlike 


sign is thus 


w(r)—e^jry 


where r is the distance between them. 

The form of the function w(r) could in principle be calculated for 
any given pair of ions by the methods of quantum mechanics, but this 
has not, actually been done (cf. Chap. VII); in the theory as developed 

t For a survey see the following: N. F. Mott and R. W. Gurney, Electronic Processes 
in Ionic Crystals (Oxford, 1940); F. Seitz, Modem Theory of Solids (New York, 1042); 
M. Bom, article in the Bandbuch der Physik, 24/2 (Berlin 193S). 
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at present it has been usual to take an empirical form for the inter- 
action energy w{r), either 

w(r) = Ar“* with 5-9 (1) 

or w{r) = (2) 

a form which is suggested by quantum mechanical calculations of the 
force between rare gas atoms.f 

The ions adhere to form a stable crystal because of the electrostatic 
attraction between unlike ions, which outweighs the repulsion between 
like ions. They keep apart because, when two ions overlap, the strong 
repulsive force with energy w(r) comes into play. On this basis we have 
to calculate the energy required to separate the crystal into its 
constituent ions. This may be done as follows: 

Let <l> denote the electrostatic potential at any lattice point of the 
crystal due to all the ions except the one occupying that point. This 
quantity has been worked out for a number of structures; for the rock- 
salt structure at a point normally occupied by a positive ion it is 

<t> = -eajalr, 

where r is the shortest distance between ions of unlike sign and ocj^j is 
a numerical constant known as the Madelung constant. For the rock- 
(^alt (simple cubic) structure it has the value 

= L7476.... 

The electrostatic energy per ion pair is thus 

-ot^eVr. 


In calculating the energy due to the overlap forces, the simplest 
assumption is that the forces fall off so rapidly with distance that only 
ions which are nearest neighbours need be taken into account. Since 
each ion has six nearest neighbours, the overlap energy per ion pair 

Mr), 

and the total energy per ion pair is thus 


J7(r)= _?!2M+6w(f). ' (3) 

r 

To determine the two unknown constants in the function w{r) we 
make use of experimental values of the lattice parameter and the com- 
pressibility (;(). The condition that the crystal is in equilibrium under 
zero pressure, namely, dU(r)ldr = 0, '(4) 


t Cf. § 33.3. 
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gives one relation between these quantities, and for the compressibility 
a short calculation gives 


1 ^ 1 d^U(T) 
X 18r dr^ 


( 5 ) 


If we use the expression (1) for w(r)f equations (4) and (6) give 

— JL. 

6A« 


and 


18 r^ 


The second of these equations gives s and the first A, all in terms of 
measurable quantities. Estimates of the repulsive energy w[r) can thus 
be made. 

The theory has been refined by including in the energy the van der 
Waals attraction between the ions, the repulsion between ions which 
are not nearest neighbours, the zero-point vibrational energy, and so 
on. By comparison of theoretical formulae with observed lattice 
constants and compressibilities, values of w(t) have been deduced for 
the interaction energy of halide ions with alkali, silver, thallium, and 
cuprous ions.f 

For the alkali-halides a particularly convenient form of interaction 
energy is given by Mayer; this is (in ergs) 

w(r) = 4 X 


where p = 0-346 x 10“® cm. and fj, are ionic radii shown in the table. 


Table 7. Basic radii and r^X 10® cm. 


Li+ . 


. 0-476 

F- . 

. 1-110 

Na+ 


. 0-876 

Cl- . 

. 1-476 

k:+ . 

, 

. 1-186 

Br-. 

. 1-600 

Rb+. 
Cfl+ . 

. 

. 1-320 
. 1-466 

1“ . 

. 1-786 


The energy per ion pair required to separate the crystal into its 
constituent ions is, on the simple treatment given above, easily seen 
to be given by 






H 


(6) 


This has a value of the order 6-6 eV. per ion pair, or 120 k.cals. As 
5/^9, the greater part of (6) is due to the electrostatic forces. |?7| is 


t For references cf. Mott and Gurney, loc. cit., chap. i. 
% J. E. Mayer, J. Chem. Phye. 1 , 270 (1933). 
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connected with the energy Uj^ per ion pair necessary to dissociate the 
crystal into neutral atoms by the formula 

\U\ ^ 

where I is the ionization potential of the metal atom and E the electron 
affinity of halogen ion. From calculated values of U (deduced from 
formula (6)), using known values of r, x and experimental values of 
V^, one can thus deduce theoretical values of the electron affinities 
E, which are not in general known in any other way. The success of 
the theory lies in the agreement between the values of E deduced from 
different compounds; this is shown below. 


Table 8. Electron affinities of the halogens in k.cals. 


Metal with which 
combined 

F 

Cl 

Br 

I 

Li ... . 

9,50 

86-7 

81-2 

75*8 

Na . . . . 

96-6 

86-6 

80-9 

73-9 

K . . . . 

95*6 

87- 1 

81-2 

73-2 

Rb . . . . 

96-8 

86*7 

82-0 

73-8 

Cs . . . . 

93-8 

87-3 

82*0 

74-2 

(Sutton and Mayerf ) 




72-4 


It seems that the theory built up along these lines gives a satisfactory 
account of cohesion in alkali and silver halides. For oxides and sul- 
phides and in general for crystals where the ions carry two or more 
charges, no recent comparison between theory and experiment has been 
made.t The difficulties in doing so are due to the following facts: 

(a) The van der Waals forces between oxide ions are large and 
difficult to estimate. 

(d) It is possible that some ‘homopolar’ forces exist between the ions 
as well as the electrostatic attraction; in other words, the over- 
lap force is not wholly repulsive in character. 

(c) The oxide ion 0“" is not stable in free space. 

37. Crystals in thermal equilibrium 
37.1. Lattice vibrations 

In a crystal in thermal equilibrium at a temperature T, we must 
<x)nsider three types of deviation from the perfect lattice: 

(а) The lattice vibrations of the atoms. 

(б) The existence of vacant lattice points. 

(c) The existence of interstitial atoms or ions. 

t Sutton and J. E. Mayer, J. Chem. Phys. 3, 20 (1935). 

X See an early attempt by J. Sherman, Chem, Rev. 11, 93 (1932). • 
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The theory of the lattice vibrations will not be considered in detail 
here.f We may remind the reader of the rather crude (Einstein) model, 
in which each atom of a monatomic solid is supposed to vibrate inde- 
pendently of all the other atoms with frequency v; the mean energy 
of each atom is thus / ^ x 

The temperature hvjk is known as the characteristic temperature and 
denoted by 0. For temperatures large in comparison with 0, (7) reduces 
to SAjT, giving a specific heat of where N is the number of atoms 
per gramme. If is the mean amplitude of the thermal vibrations 

^ = ¥T/MkG^ ( 8 ) 

where M is the mass of an atom. 

In an exact theory of the lattice vibrations and of the contribution 
of the specific heat due to them 4 it is necessary to analyse the vibra- 
tions of the lattice into its normal modes, and denote by n(v}dv the 
number of normal modes with frequency between v and v-\-dv; the total 
number of modes for a solid of N atoms is 3iV^, so that 

J n(v) dv — ^N. 

The total integral energy is thus 

j jdv. 

The calculation of n(v) for any real solid is a very difficult problem. 
Debye showed that for small v 

n(j/) == const, 

and showed how to calculate the constant in terms of the elastic 
constants of the solid.§ 

37.2. Vacant lattice points and interstitial ions 
In addition to the thermal vibrations, a crystal in thermodynamical 
equilibrium differs from a perfect crystal through the presence of 

(а) Interstitial atoms or ions. 

(б) Vacant lattice points. 

The case of a monatomic solid (metal) will be treated first. 

It will be possible for atoms from the surface layer to diffuse into 
the body of the crystal, where they will be found in the so-called 

t See, for example, R. H. Fowler, Statistical Mechanics^ 2nd ed. (Cambridge, 1937) or 
P. Seitz, loc. cit. 

X For most materials this is much the most important contribution. 

§ For recent calculations of n{p) cf, M. Blackman, Proc. Roy, Soc. A, 148, 384 (1935), 
159, 46 (1987); Proc. Camb. Phil. Soc. 33, 94 (1937). Also see Seitz, loc. cit., p. 103. 
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‘interstitial positions’, as illustrated in Fig. 48a. We may denote by 
Wi the energy necessary for the formation of an interstitial atom. 
Wi will tend to be large for close-packed structures, for which the space 
available in an interstitial position is small and the distortion of the 
surroundings necessary to make room for an interstitial atom large. 



(a) (b) 

Fig. 48. Showing formation of interstitial ions (a) and vacant lattice points (&). 


In any case an elementary calculationf gives for the number of 
interstitial ions per unit volume 


where is the number of interstitial positions per unit volume. 

In the same way, vacant lattice points (holes) can be formed at the 
surface by diffusion inwards, as shown in Fig. 486. Let Wf^ be the 
energy necessary to form a hole ; then in equilibrium the concentration 
of holes is ^ 


where N is the number of lattice points per unit volume. 

For close-packed metals it is probable that is less than TfJ, so that 
the holes will greatly outnumber the interstitial ions; for copper, theo- 
retical calculations! indicate that this is in fact the case. According 
to these workers, self-diffusion in metals takes place through the motion 
of holes. It is suggested that an atom will not move from one lattice 
point to another unless there is a hole next to it.§ 

In ionic crystals, in principle it is possible to have vacant lattice 
points and interstitial ions of both signs; in general, however, the energy 
required to put the large negative ion into an mtefstitial position is 

t Tbe ni^paJber of ways that the n^ ions can be distributed 4 
and the free energy is therefore 

n^W^-^kT log P, 

Mahing this a minimum with respect to charges in n^, and remembering that ^ N^t 
the given equation is obtained. , 

t H. Huntingdon and F. Seitz, P/iys. Rev. 61, 315 (1942). ‘ ^ 

§ For a discussion of the mechanism, of. K P. Johnson, Phya. Rev. 56, 814 (1939); 
Q. Wyllie, JVoe. PAya. <Sfoc. 59, 694 (1947). 
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SO lai^e tiiat the concentration is neghgible. Also any large volume 
of the crystal must be electrically neutral; otherwise large fields will 
be set up by the space charge. 

It is thus convenient to consider two types of defect, both of which 
are electrically neutral. 

(a) An interstitial ion (usually positive) and some distance from it 
a vacant lattice point from which the ion may be supposed to have 
come. This is called a Frenkel defect.f It will be noted that a Frenkel 
defect can be formed in the body of the crystal. If Wf is the energy 
necessary, the concentration of Frenkel defects is given by J 

n = 

(b) Two vacant lattice points, one of each sign, wandering about 

independently of each other. This is called a Schottky defect. If Wg is 
the energy necessary to form a Schottky defect, the concentration 
will be ^ _ ]^^-wjkT^ 

The formulae are valid only if WJ, WJ differ considerably, so that only 
one type of defect is actually formed in appreciable quantities. 

It is beheved that Schottky defects are formed in alkali halides some- 
what above room temperature, § while Frenkel defects are formed at 
a concentration of about 10’^® per atom in silver hahdes at room 
temperature. 

The ionic conductivity that all solid ionic crystals show at high 
enough temperatures is probably due to defects of one or other of these 
types, and either to the motion of interstitial ions, or to the motion 
of holes, or both. 

A direct proof of the importance of holes (vacant lattice points) is 
given by the well-known experiments of Koch and Wagner;|| crystals 
of silver halide were prepared containing small quantities of CdCl 2 or 
CdBrj in solid solution. In these crystals each Cd++ ion replaces two 
Ag+ ions in the lattice; the crystal must thus contain at least as many 
holes as there are Cd++ ions. The number of holes is thus greatly 
increased and the conductivity, especially at comparatively low tem- 
peratures, is greatly increased also. 

38. Electronic motion in crystals 

For the understanding of the motion of electrons in both metals and 
non-metals it is essential to study the motion of an electron in a field 

, t X Fienkel, Zeita.f. Phyeik, 35, 652 (1926). 

^ ^ Mott aad Gumoy, loc. cit., chap, ii: 

. $ F..SeitK, Nev. Mod, Phya. 18, .384 (1946). 

J[ E. Koeh and C. Wagner, Zeiia.f. phya. Chem, B, 38, 296 (1937). 
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which is periodic in space with the period of a crystal lattice. The 
necessity for this may be seen most easily by considering a crystal such 
as sodium chloride into which an extra electron has been introduced. 
(In practice this can be done by photo-electric excitation, cf. Mott and 
Gurney, loc. cit., Chap. IV.) One can imagine the electron located on 
one of the metal ions and thus forming a metal atom. But if the 
electron transfers its position to the next atom, there is no change of 
energy; and the potential barrier between the two positions can easily 
be penetrated by tunnel effect. The electron is thus free to move 
through the crystal. The wave functions describing its motion will be 
wave packets or plane waves, like those describing the motion of an 
electron in free space, and can extend right through the crystal. 

Similar wave functions are used to describe the free electrons in 
metals. Their use depends on the assumption that, so far as its effect 
on a given electron is concerned, the field of all the other electrons can 
be averaged in the sense of the Hartree approximation (Chaps. V, VI). 
This will be discussed further in the section devoted to metals. 

In either case, therefore, it is necessary to study the motion of an 
electron in a field which is periodic with the period of the crystal lattice. 

38.1. Motion in a periodic field 

In order to understand the properties of an electron in a periodic 
field we shall consider first the one-dimensional case; we consider, then, 
the solutions of the Schrodinger equation 

(9) 

where F is a periodic function of x with period a. 

For a solution ^ to represent a possible state of the electron in an 
infinite metal, \fj must be bounded for all x. We shall show that the values 
of the energy W for which such solutions are possible fall into a series 
of bands, as shown in Fig. 49. All such solutions can be written in the 

e^^^u(x) (—nja <k < nja), ‘ 
where u{x) is periodic with period a. ^ 

For e^h allowed band of energy values, h varies continuously from 
— Tr/a to Trja, so that a series of allowed energy bands can be defined, 
Wi{k), Bi(^), and so on. 

Various proofs have been given, and we shall refer to one due to 
Kramers, j* Kramers first points out that, if ^^(a!) and S'^e inde- 

t H. A. Kramers, Physica, 2, 483 (1935); see also F.» Seitz, loc. oit., p. 278. 
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pendent solutions of the Schrodinger equation, then since ijiiix+a) is 
also a solution, it follows that 

It is then easy to show that two linear combinations of may be 
chosen, <f> 2 , with the property 

^i(x+a) = Xi<t>i(x), 

<l>t(x+a) = Xiiiix). 

From these equations we have, differentiat- 
ing each equation and multiplying by the 
other, 

(01^2~^1^2)a;+a = 

( 10 ) 

For any Schrodinger equation, however, if 
<^i, <j )2 are any two independent solutions 
corresponding to the same energy, 

4’i^i-^i<l>2 = 0 . 

so that ^ 1 ^ 2 ”*^! ^2 const. 

It follows from (10) that A^Ag = 1. 

Now if j/r is bounded, it is clear that both Ai and Ag must be real or 
imaginary quantities with modulus unity. For if this were not the case, 
then 1^1 would increase either with increasing or decreasing x, and 
would not be a bounded function. Thus we may write 

Ag = e-^K 

It follows that the two solutions (^g have the forms 

where Ujf.(x) is periodic in x with period a. 

It follows that the lowest state* of each band of energy values is 
periodic, and that the higher states consist of plane waves modulated 
by the periodic field. f 

Wigner and SeitzJ have given the following method for determining 
the wave fimction in the lowest state in one of the energy bands in 
close-packed metals, which is of particular importance. They point 
out that if planes are drawn bisecting the lines which join nearest 
neighbours in the crystal, a polyhedron, is formed surrounding each 

t For an evaluation of the wave functions in a particularly simple type of periodic 
field, of. R. Kronig and W. G. Penney, .Proc. Roy. Soc. A, 130, 499 (1931). 

X E. Wigner and F. Seitz, Phya, Rev. 44 , 804 (1933) ; of. also Mott and Jones, loc. oit., 
chap. iv. 

6071 p 



Fig. 49. Allowed values 
of the energy TT as a func- 
tion of wave-number k. 
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atom. Supposing we now look for a periodic wave-function u(x,y,z) 
which has the symmetry of an 8 wave function (spherical symmetry) 
in the neighbourhood of each atom. Then clearly dujdn must vanish 
on the boundary of the polyhedron. In ortler to calculate u, the poly- 



Fio. 60. Wave functions for metallic sodium, copper, and silver. 

hedron Js replaced by a sphere of equal volume, radius rg. Then the 
Schrodinger equation 

has to be integrated subject to the boundary condition 
du ^ r, * 

-3- = 0 for r = ro- 
ar 

It is clear that a whole series of energy values will exist, for which 
u vanishes .On 0, 1, 2,... spheres round each nucleus. For sodium with 
configuration (l8)*(28)*(2p)®(35)^, the states with respectively zero and 
one node will correspond to the X-ray X and L levels, and the wave 
functions will not differ much from those for the free atoms. The case 
corresponding to the valence electrons of various metals is shown in 
Fig. 50. For sodium states with more than two nodes correspond to 
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high 'positive energies W. Each of these wave functions and energy 
values corresponds to the lowest state of a band of energies. These 
bands will be referred to as s-bands or ^-zones. 

The variation of energy with k within an .9-band can be investigated 
by means of an approximate method known in the literature as ‘the 
method of tight binding*. It will be seen that the wave-function 
can be written in the form 

(12) 

where <^k(r) is some function with the requisite number of zeros tending 
to zero at a few atomic radii from r ~ 0, and the summation is over all 
lattice points F/. The wave function of Wigner~8eitz type illustrated 
in Fig. 50 could, for instance, be so represented. Let us assume this 
done; and make the following approximations: 

(a) <j) is independent of k; 

(b) overlap is negligible between the wave-functions <^(r—r^), 
^(r— r^.) unless the lattice-points r^, are nearestmeighbours. 

Then, denoting by H the Hamiltonian operator, 

V./m 


the energy Mi of the state k will be given by 


Wt. = H<l>„ d^jj dr. 

With the approximations given above, this reduces to 


( 13 ) 


Here pj is the vector joining an atom to its nearest neighbours, and 
the summation is over all the neighbours of any one atom; also 


A = j^*H<l>dr, 

A' = dr, 

5 = J^*(r)H^(r-p)dT, 


H' = J^*(r)^r-p)dr. 

For a simple cubic lattice of lattice constant a for example, 
= 2{(iOHk^a+ooskya+(iOskga}. 
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From this formula, we see that all values of k within the cube 
—■IT fa <,k^ < rrja 
—Tr/a <ky < Trja 
—TTja <% < Tr/a 

correspond to independent wave functions with energies in the 5-band. 
The corresponding cube in Z;-space (Fig. 61) is called a ‘Brillouin zone'. 


A:x 


(b) 

Fig. 61. Zone in A:-spacefor «-statesin (a) a simple cubic lattice, and (b) in a 
hexagonal close-packed lattice (two dimensicms). 

It is clear that for a crystal of JV atoms the number of electronic 
states in the zone is 2.^, since two spin directions are allowed for each 
wave-function This is the case for any crystal structure. For the 
simple cubic it is instructive to verify this result by quantizing k by 
laying down the condition that ijj must be periodic in x, y, z with period 
Tia where n is a large integer. This is a conventional quantizing condi- 
tion; it would be appropriate for a wire bent round into a circle, for 
a coordinate measured along the circumference of the wire. We deduce 
from this condition that 

= 27m7na {n' = —in,..,, 0,..., +in), 
which gives the required number of values. 

To find the limits of the zone for structures 
other than simple cubic, we must draw in k- 
space the lines for which first-order Bragg re- 
flections occur. This will enclose the required 
zone. This may be seen as follows: Fig. 52 shows 
a series of crystal planes, and it is supposed that 
a wave-vector k in the direction shown is re- 
flected by them. Then, by the Bragg condition 
for reflection, in the wave function (12) the sign 
of the coefficient of the wave function wiU change in going from one 
atom to the next. It is clear thkt this represents the extreme form of the 
wave function, in the sense that a larger numerical value of k in the 
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same direction would lead to a repetition of wave functions already 
described by smaller values. 

We shall illustrate this principle by considering the s Brillouin zone 
for close-packed atoms in a plane (Fig. 61 6). There are then three sets 
of planes, at angles of 120° to each other, and spaced V3a/2 apart. 
The first Brillouin zone is thus a hexagon of side 1/V3a and volume 
V3/2a2. Since the number of atoms per unit area is V3/2a^, it will be 
seen that we have in the zone a number of states equal to the number 
of atoms. 

For 2 we obtain 

2 = 2 cos 4 cos a cos | V3 ky a. 

It will be seen from formulae (13) that near the bottom of any zone 
W — const.-f ak^. 

We may write a = ^2/2m*; 

m* will then be called the ‘effective mass’ of an electron moving in 
the crystal lattice. For energies near the bottom of a band, an electron 
moving in a periodic lattice will behave exactly as a particle in free 
spacef with charge e and mass m*. The approximation is often made 
of replacing m* by m. 



Fig. 63. p wave function of lowest energy for metallic 
lithium ; the dots represent adjacent atoms. 

In addition to the 5-zones already described, p- or d-zones can be 
formed by a similar process. They will not be discussed in any detail 
here. Fig. 53 shows the general appearance of the lowest p wave 
function for lithium. The symmetry of such a wave function is such 
that it will be described by k^ = w/a rather than k = 0, 

Although the energy corresponding to this wave function will in 
general be higher than that of the state (k^ = Tr/a, ky — kg = 0)m the 
5-zone, it wiU not necessarily be higher than all states in the 5-zone. 
Thus the energy band of the 5-zone may overlap the energy band of 
the p-zone. If one plots against energy W the number N(W)dW of 
states in the energy range dW one gets a curve such as that of Fig. 64a. 

t It is assumed that the lattice is rigid, i.e. that the atoms are not vibrating. Under 
these conditions the mean free path is infinite. In practice an electron will move with a 
finite free path, as shown in § 40.7. 
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39. Conductors and insulators 

In metals and also non-metals such as diamond one can treat each 
electron as moving in the field of the positive ions and the field of all 
the other electrons, averaged in the sense of the Hartree or Fock 
approximation (Chaps. V, VI). This field is, of course, periodic, so that 
the wave functions are of the type described in the last section. 
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Fig. 64. Density of states in solids; N{ W) dW gives the mimber 
of states with energies between W and W-{-dW, (a) Monovalent 
metal ; (6) divalent metal ; (c) insulator ; {d) graphite. 


Only two electrons are allowed to be in each state specified by a 
quantized value of k. We have seen that each 5-zone contains 2^ 
states. Therefore, for monovalent metals (sodium, silver, etc.), the first 
zone will be half full. This is shown in Fig. 54, where curves plotting 
N(W) against W are shown. 

If a zone is half full, it is clearly possible for the solid to carry a 
current, because within the zone a state can exist in which there are 
more electrons moving in one direction than the other. It is clear, 
therefore, why atoms such as Cu, Ag, Au, Na, K, etc., form metallic, 
conducting lattices. 

On the other hand, if a zone is fully occupied, the electrons of the 
zone cannot contribute to the current because exactly as many are in 
states with wave- vector k in one direction as in t^e other. Insulators, 
therefore, are materials in which each zone is either full or empty 
(Fig. 54 c)^ An important intermediate case (graphite)t sketched in 
Fig. 54 d. 

The fact that atoms with two electrons in the outer ring form con- 
ducting lattices (except helium) shows that for these materials the 5 -zone 
must overlap the jj-zone, as shown in Fig. 546. 

t P* Walkoe, Phy8. Rev. 71, 622 (1947). 
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It must be emphasized that the model used here, in which each 
electron is supposed to move in the averaged field of all the other 
electrons, is of course an approximation. What is assumed is that, 
if an antisymmetrical determinant of the type of Chapter VI, equation 
(11) is set up, in which the orbital wave functions are of the form 
then this will be a useful approximation of zero order to the 
true wave function. It is very difficult to give convincing reasons why 
this should be the case for metals. 

For further discussions of the validity of the collective electron 
model, and for attempts to improve it, cf. papers by Schubin and 
Wonsowski,t and by Bethe.J 


40. Applications of the collective electron model to metals 

In this section an outline only can be given of some of the applica- 
tions of the collective electron method to the properties of metals: 


40.1. Emission and absorption of soft X-rays 
If any metal is used as an anticathode in an X-ray tube, part of the 
emitted radiation will correspond to transitions from the conduction 
band (the occupied states) into a lower X-ray level. If the mean energy 



hv of the emitted radiation is not too great compared with the width 
of the band of energies of the occupied states, the form of the emitted 
band can be determined experimentally. Fig. 65 shows the form of 
the Lg band for sodium and magne8ium.§ From the theoretical point 
of view, we expect the emitted intensity to be proportional to 

N(W)p(W), (14) 

where p(W) is the transition probability from a state of energy W into 
the X-ray level. The form of the band shown in Fig. 55 b has the 
characteristics that we expect for a divalent metal, namely two over- 
lapping zones and a sharp cut-off for high energies. 

t S. Schubin and S. Wonaowski, Proc. Bay. Soc, A, 145, 169 (1934). 

X H. Bethe, Zeits.f. Physik, 71, 206 (1931). 

§ Prom H. W. B. Skinner, Rep. Progress in Physics^ 5, 271 (1938); see also H, M; 
O’Bryan and H. W. B. Skinner, Proc, Roy. Soc, A, 176, 229 (1940). 
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In the same way the absorption intensity may be expected to be 
given by (14), where N(W) refers to the empty levels. 

For a review of the emission and absorption of soft X-rays, and their 
interpretation in terms of theory, cf. Skinner, loc. cit. 


40.2, Specific heats 

The electrons in a metal are subject to the Fermi— Dirac distribution 
law. According to this law, the probability that at temperature T a 
state with energy W is occupied is 




(16) 


It means that if W is much less than i.e. if IT > kT, the probability 
is practically unity; while, if W — { ^ kT^ the probability is approxi- 
mately 

C,AW-l)lkT^ 

The parameter I is given by the equation 


J N(W)f(W) dW = total number of electrons. 

0 

5 differs only slightly from the energy of the highest occupied state at 
the absolute zero. f{W) is illustrated in Fig. 66. 



Fig. 66. Fermi distribution law for electrons in a metal. 


The energy of the electrons at temperature T is 
/ N(W)i(W)WdW. 

This differs from the value at absolute zero by a term of order 
N {i)(1cTY, since the number of excited electrons will be of order N{QkT 
and they will each have energy of order kT. The contribution of the 
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electrons to the specific heat is thus linear in the temperature. A calcu- 
lation of its magnitude givesf 

If the calculations refer to a gramme-atom, N{^) is of order 

where N is the number of free electrons per gramme-atom. Assuming 
this to be of the same order as the number of atoms, we see that the 
specific heat is of order 

ZNh 

Since £ is equal to several electron volts, the electronic specific heat 
at room temperature will be of the order of 1 per cent, of that due to 
the vibrations of the atoms. 

At low temperatures, however, where the contribution of the lattice 
vibrations varies as T®, the contribution of the electrons is the pre- 
dominant term, and at the temperatures of liquid helium has been 
observed for a number of metals. Expressing the linear term per 
gramme-atom as yRT, y has the following values (x 10"^): 

Cut Al§ Ni|| Feft PdJt PtJ 

0-888 1-742 8-742 6-0 16 8-0 

The value for aluminium holds only down to the transition point to 
the superconductive state, 1*13° K. 

The high values for the transition metals will be noted. These are 
associated with a partly filled band formed from the d-states of the free 
atom.§§ 

40.3. Paramagnetiem 

As for the specific heat, one can argue that the number of free 
electrons (free to change their spins without infringing the exclusion 
principle) is kTN(l) and that each of these contributes fi^IkT to the 
susceptibility (/i = ehl2mc). An exact calculation gives for the suscep- 
tibUity 2|n*^(S). 

t Cf. N. F. Mott and H. Jones, Theory of the Properties of Metals and Alloys, p. 178 
(Oxford, 1936). 

X J. A. Eok and W. H. Keesom, Physica, 3, 1036 (1936). 

§ Ibid. 4 , 835 (193T). 

II W. H. Ee6l3om and C. W. Clark, ibid. 2, 513 (1935). 
tt W. H. Keesom and B. Kurrelmeyer, ibid. 6, 364 (1939). 
tt G. L. Pickard, Nature, 138, 123 (1936). 
i§ Mott and Jones, loc. cit., chap. vi. 
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One therefore expects a correlation between the electronic specific heat 
and the susceptibility. As shown above, the paramagnetic metals Pd 
and Pt do in fact have large electronic specific heats; but the values 
of N{^) determined from the specific heats are 2-4 times smaller than 
those from the paramagnetism. It is known that for ferromagnetic 
materials an interaction between the electrons exists which leads them 
to set themselves with their spins parallel; for Pd and Pt a force of 
this type probably exists also, not strong enough to lead to ferro- 
magnetism but enough to increase the paramagnetism. 


40.4. Thermionic emission 

An equation for the current emitted from a metal surface at tem- 
perature T under fields strong enough to give saturation can be derived 
as follows, it being assumed that the effect of Bragg reflection (i.e. of 
the zone structure) of the electrons in the metal can be neglected. 

Within the metal, the number of electrons with moments between 
Pxy in the volume element dx, dy, dz is given by the 

Fermi-Dirac law 

2 dxdydzdpjp^dp, lur_P^\ 

4W-mT - 2mf ’ 

For electrons with energy great enough to leave the metal, this 
becomes « 

— dxdydz dp^ dpy dp^ gii-wmy 
nr 

Let the surface of the metal be the yz-plane. Then we have to con- 
sider only the motion perpendicular to this plane. Since 

00 

J ^-p*l2mkT^p „ ^(27rWl^T), 

— 00 

it follows that the number of electrons per unit area of surface in a 
layer of thickness dx and with p^ in the range dp^ is 

Let i+<lf he the minimum energy that an electron must have in order 
to leave the metal, and let r be the reflection coefficient for an electron 
with momentum Then the number leaving the^ surface per unit 
area per unit time is 

00 

iH 
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On the assumption that r ^ 0 (probable in view of the existence of the 
image force) this gives for the current per cm.* 

where A = ArTmkhjh^ ^120 amp./cm.* 

For a discussion of the value of A for different models for the surface, 
cf. R. H. Fowler, Statistical Mechinics, 2nd ed. (Cambridge, 1936), 
p. 349, and for a discussion of the validity of the formula, cf. F. Seitz, 
Theory of Solids , p. 165. 


zero 



Fig. 57. Showing potential energy of an electron at the boundary of a metal. 

— In the absence of a field. 

With a field. 

In fairly strong fields the work function is reduced; this effect is 
known as Schottkyj* emission. This is because the field on an electron 
outside the metal at a distance x from the surface is that due to the 
image force Outside the surface, then, the potential energy 

of an electron is —e^jix. When the field F is introduced an additional 
term ~eFx is added to the potential energy. The function 

—eFx—e^l^x 

has its maximum at x = i^jielF), and has there the value -e^FK 
Thus the effective work function <(>' is 

f 

This is shown in Fig. 67. 

The current Ijr for a field F should thus satisfy the relation 
T^log/^-log/o) _ ei 

FI k' 

This relation has been shown to be valid over a remarkably wide range 
of temperatures and for fields up to 10® volt/cm. by de Bruyne.J 

40.5. Cold emission 

For very strong fields (of the order 10’ volt/cm.) emission from an 
electrode into a vacuum will take place by tunnel effect throv^h the 

t W. Schottky, Zeits.f. PhysiJe, 18, 63 (1923). 

. J N. A. de Bruyne, Proc, Boy, Soc. A, 120, 423 (1928). 
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potential barrier ABC of Fig. 57. A current is thus obtained which is 
independent of temperature. 

The theory of the effect is as follows. Let us denote by D('p^ the 
probability that an electron with momentum in the a:-direction will 
pass through the barrier. Then the number of electrons leaving unit 
area per unit time is o ^ 


where the integration is over all values of p such that 

< 2m?. 

Setting 2mJ = and writing Pa? = easily seen that this 

gives 4^2 


JiXp- 


0 

where the upper limit of the integral may be taken to be infinity, since 
D decreases rapidly with increasing 6. 

The transparency factor D wiU be of the form Ae-^, where A is a factor 
of the order unity, and j8 may be obtained from the WKB approximation 
(Chap. I, §3.3) as follows: 


j8 = 2/< J V(F- W)dx (s: = 2m/«*) 

(X-()leF 

= 2k J ^(eFJdx (2m^ = pl) 
For values of in the neighbourhood of p we have 



Putting A = 1, this gives for the number of electrons emitted per unit 
area per unit time g 2 j ^2 

27rh<l> 

If the approximation A = 1 is not made, a result is obtained which 
does not differ from this appreciably! if J. 


40.6. Metallic cohesion 

No detailed account will be given in this book of the theory of metallic 
cohesion. In the treatments given up till now the energy of the metal 
has been divided up into the following parts: 

(a) The interaction of the conduction electrons with each other and 
with the metallic ions, and the electrostatic interaction of the 
latter with each other. 

f Of. R. H. Fowler, loo. oit. 
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(6) The repulsion which comes into play when the ions overlap. 

In copper or silver (b) is important; the ions appear to be in contact, 
so that the force between them is the determining factor for the com- 
pressibility; for the alkali metals, on the other hand, the ions do not 
overlap. For metals such as tungsten it may not be possible to make 
a distinction between ions and conduction electrons. 



Fig. 68. Density of electronic states, (a) face -centred, 

(6) body-centred cubic. 

In considering the force (a), it is of importance that the ‘atomic 
sphere’ surrounding each atom is electrically neutral; thus in calculat- 
ing the potential energy of the system one has only to consider the 
interaction of the electrons within each sphere with each other and with 
the ion. This interaction is mainly responsible for the cohesion of the 
metal. 

For metals with more than one electron per atom the kinetic energy 
of the electrons will be much more sensitive to the crystal structure 
than for metals with one electron. This is because of the zone structure 
explained in an earlier section. Fig. 58 shows the density of states. 
N{W)dW is the number of electrons with energies between W and 
W +dW for the conduction electrons in the 5-zone in a metal with body- 
and face-centred cubic structures, and the same atomic volume; the 
calculations are due to H. Jones.t It will be seen that for metals with 
only one electron per atom there is little difference between the energies, 
but for larger numbers the body-centred structure has the lower total 
energy. On the basis of these ideas Jones has shown why in alloys of 
CuZn, CuAl, CuSn, etc., transitions from the face- to the body-centred 
structure and other similar transitions occur at a nearly constant ratio 

t H. Jones, Proc. Phys. Soc. 49, 243 (1937). 
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of free electrons to atoms (Hume-Rothery jnle). For these two parti- 
cular structures it is not difficult to see why this should be so. For the 
face-centred structure reflection occurs first for the (111) set of planes, 
and thus for a wave-number h equal to For the body-centred 

cubic, on the other hand, where the longest wave-length for the Bragg 
reflection is given by the (110) lines, h is V2d/7r. In these formulae d is 
the lattice parameter in either case. If a® is the atomic volume, then 
since the two structures contain 4 and 2 atoms respectively in the unit 
cube, these wave numbers are in the ratio 


V3a/2l and V2a, 

i.e. l*37a and l*41a. 


40.7. Electrical conductivity 

The wave mechanical theory of electrical conduction in metals is 
based on the following deductions from the theory: 

(a) Free electrons will go through a perfect crystal lattice for an 
infinite distance without being scattered. 

(^) Scattering, giving a finite mean free path, can be due to im- 
purities or foreign atoms in solid solution, thermal vibrations, 
crystal boundaries, or any other disturbance to the periodicity 
of the lattice. 

(y) In the scattering process an electron gains or loses an energy of 
the order kT and thus small compared with C (the Fermi energy); 
the only electrons which can make collisions are those with kinetic 
energy approximately equal to 

Calculations of the conductivity due to heat motion at low tem- 
peratures (T < 0, where 0 is the Debye temperature) will not be 
considered here; if T > 0 for pure metals, send also for the resistance 
due to impurities, it is possible to express the conductivity of a metal 
in the following way (Mott and Jones, loc. cit., p. 268 et seq.): 

The conductivity a is given by 


<7 = ne^jm^ / 

where n is "the number of free electrons per unit volume and t, the time 
of relaxation, is given by ^ 


where N is the number of atoms per unit volume, v the Tnn-YiTmim 
velocity of the Fermi distribution, and A an effective collision area 
given by 

^ J {l---cos^)/(0)da). 


A 
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where 1(6) is the cross-section for scattering into the solid angle do) by 
each atomf (in an alloy the mean probability). 

From this formula we can deduce that for T > 0 the resistance of 
a pure metal is proportional to T\ for if x is the displacement of any 
one atom from its mean position, it is clear that the scattered amplitude 
is proportional to a;, and hence that 1(6) is proportional to The 
mean value of x^, however, is proportional to potential energy of all 
the vibrating atoms and thus to half the total heat content. This in its 
turn is proportional to T. 

We may deduce also an instructive formula due to NordheimJ on 
the resistance of alloy systems such as AgAu which form a continuous 
range of solid solutions. Suppose that the two types of atom are 
denoted by A and B and that Vj{r), I^(r) denote the potential energies 
of an electron in an atom of either type. If the number of atoms of 
the two types in the alloy are in the ratio c to 1— c, then the mean 
potential energy in all atoms is 

cy^(r)+(l-c)V^(r), 

In A atoms the potential differs from this by 

{^-cW^-Va) 

and in B atoms by 

The intensities of the waves scattered from atoms of the two types are 
thus proportional to v 2 -2 

[L C) , C , 

and hence the total scattering (resistance) is proportional to 
c(l-c)2+(l-c)c2, 

which reduces to c(I— c). (17) 

Equation (17), therefore, sl^pws the way in which the residual resistance 
of an alloy varies with composition. Good agreement with experiment 
is obtained for alloys such as PdPt, AgAu. 

41. Electronic motion in non-metals 

In this section a summary will be given of the electronic behaviour, 
from the theoretical viewpoint, of ionic and other crystals which are 
insulators at low temperatures and if not exposed to light which they 
absorb. 

Such crystals have a full zone of energy levels separated by a finite 
energy gap from an empty zone (the ‘conduction band’). If an extra 

t Cf. chap. ix. 

t L. Nordheim, Ann, der Phyeik, 9, 007 (1931). 
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of free electrons to atoms (Hume-Rothery jnle). For these two parti- 
cular structures it is not difficult to see why this should be so. For the 
face-centred structure reflection occurs first for the (111) set of planes, 
and thus for a wave-number h equal to For the body-centred 

cubic, on the other hand, where the longest wave-length for the Bragg 
reflection is given by the (110) lines, h is V2d/7r. In these formulae d is 
the lattice parameter in either case. If a® is the atomic volume, then 
since the two structures contain 4 and 2 atoms respectively in the unit 
cube, these wave numbers are in the ratio 


V3a/2l and V2a, 

i.e. l*37a and l*41a. 


40.7. Electrical conductivity 

The wave mechanical theory of electrical conduction in metals is 
based on the following deductions from the theory: 

(a) Free electrons will go through a perfect crystal lattice for an 
infinite distance without being scattered. 

(^) Scattering, giving a finite mean free path, can be due to im- 
purities or foreign atoms in solid solution, thermal vibrations, 
crystal boundaries, or any other disturbance to the periodicity 
of the lattice. 

(y) In the scattering process an electron gains or loses an energy of 
the order kT and thus small compared with C (the Fermi energy); 
the only electrons which can make collisions are those with kinetic 
energy approximately equal to 

Calculations of the conductivity due to heat motion at low tem- 
peratures (T < 0, where 0 is the Debye temperature) will not be 
considered here; if T > 0 for pure metals, send also for the resistance 
due to impurities, it is possible to express the conductivity of a metal 
in the following way (Mott and Jones, loc. cit., p. 268 et seq.): 

The conductivity a is given by 


<7 = ne^rjinf 

where n is "the number of free electrons per unit volume and t, the time 
of relaxation, is given by ^ 


where N is the number of atoms per unit volume, v the Tnn-YiTmim 
velocity of the Fermi distribution, and A an effective collision area 
given by 

^ J {l---cos^)/(0)da). 


A 
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of ‘impurity centres’; for these the energy Q required to remove an 
electron into the conduction band is low enough, at the temperature 
considered, for a sufficient number of electrons to be free to give a 
significant contribution to the conductivity. 

According to the theory originally given by A. H. Wilson,t if -AT is 
the number of impurity centres per unit volume, the number of free 
electrons (in the conduction band) is, per unit volume, about 

Various types of impurity centres are believed to exist, as follows: 

(а) In materials such as zinc oxide a small excess (say, one part in 
10*) of zinc can exist. The zinc ion can be lodged interstitially in the 
lattice. Round such an ion the force on an electron in the conduction 
band is where k is the dielectric constant. It can thus be shown 
that there will exist, just below the level of the conduction band, a 
series of stationary states leading up to a series limit, just as for a free 
atom; but that, for the ^-states, at any rate for which the field near 
the centre does not matter much, the order of magnitude of the energies, 
measured from the bottom of the conduction band as zero, is 

— (18) 

or smaller by a factor than for a free atom. Also the radial extent 
of the wave function is of order 

nW/c/me^. 

These results will apply, as regards order of magnitude, for the ground 
state (w = 1), provided that the free atom is too large to go into the 
interstitial space. Since k^^IO for most semi-conductors, it will be 
seen: 

(i) that the energies required to release an electron may well be as 
small as 0*1 eV.; 

(ii) that the radial extent of the dissolved atom may be of the order 
10-30 A, and interaction between the centres may be expected 
at quite low concentrations. This may account for the known 
fact that Q often depends on concentration. 

(б) Excess metal may also be taken up in an ionic lattice through 
the presence of points in the lattice from which a negative ion is missing. 
The field round such a point will be again of the form an electron 
can be trapped there, and the same arguments about small energy and 

t A. H. Wilson, ibid. A, 133, 468 (1931); Theory o/ Jtfstois, p. 66 (Cambridge, 1936); 
Sem^owktctore and MekUe (Ocmbridge, 1939). 

607ft Q 
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large radius will apply. The ‘^-centres’ or colour centres which can 
be introduced into alkali halides by heating in metal vapour are of 
this type. 

(c) If an oxide contains excess oxygen, this will be taken up through 
the presence of lattice points from which a metal ion is missing , the oxygen 
ion being too big for an interstitial position. To keep the substance 
neutral, an equal number of positive holes must be present, which will 
be trapped in the neighbourhood of these points. As the temperature 
is raised, some will become free, and conduction becomes possible 
through the motion of positive holes. This is the mechanism responsible 
for conduction in CuaO with excess oxygen. 

(d) In 8e, Si, SiC the mechanism is not understood in detail. One 
can, however, say that the presence of atoms with an electronic struc- 
ture which is not a closed shell in numbers greater than that demanded 
by the crystal structure should give electronic (excess) conduction, and 
the presence of vacant lattice points should give defect conduction 
(conduction by positive holes). The higher the dielectric constant, the 
lower will be Q, other things being equal. 

41.2. Optical properties of impure crystals 

Any dissolved atom (i.e. electron trapped in the field of an interstitial 
ion or place where a negative ion is missing) should have in principle 
a line absorption spectrum leading up to a series limit. This is because 
the electron moves in a Coulomb field superimposed on the periodic 
field of the lattice. In practice these absorption lines will be greatly 
broadened by the vibration of the crystal. 

The well-known J-band which accounts for the colour of alkali 
halides with excess metal is probably an s-p transition of the trapped 
electron. 

Absorption of light in the band due to an impurity, therefore, will 
not lead directly to photo-conduction; nevertheless photo-conductivity 
k often observed, for instance, in alkali and silver halides. This is 
probably because the energy interval between the excited state of the 
centre and the bottom of the conduction band isw small that heat 
motion enajbles the electron to escape before it faUs back to the ground 
state, except at very low temperatures. 

41.3. Photo-conductivity 

One must distinguish between photo-conductivity of a semi-conduc- 
tor, where the absorption of light merely increases the conductivity 
already present, and the primary photo-current in insulators (alkali 
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halides with i’-centres, silver halides at low temperatures, etc.). The 
latter has been investigated by R. W. Pohl and his school.t In this 
work a crystal is placed between electrodes, given a short illumination, 
and the quantity nex measured, where n is the number of electrons 
released (usually equal to the number of quanta absorbed) and a? the 
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Fig. 69. Path of an electron released by light in 
an alkali halide crystal. 

distance drifted by each electron before it is ‘trapped*. This trapping 
is a marked feature of insulating crystals; an electron in the conduction 
band survives only a certain time before being trapped. The energy 
necessary to release an electron from a trap is, of course, less than from 
one of the original impurity centres, a;, the distance drifted, will be 
proportional to the field F and inversely to the number of traps. 

Other examples of trapping are: 

(a) In zinc sulphide phosphors, where illumination at low tempera- 
tures gives rise to trapped electrons which return to their original 
centres with emission of radiation only after warming up. 

(b) The action of light on photographic emulsion, where the first 
process appears to be the release by light of an electron which 
is trapped at a point where the latent image will be formed. 

41.4. Optical properties of pure crystals; the exciton 

In this section we discuss the optical absorption of a pure non- 
metallic crystal, that is to say, the absorption spectrum due to the atom 
or ions that form the crystal lattice. It should be emphasized that any 
allowed transitions of this type will lead to a very strong absorption 
coefficient, of the order 10* cm.-^ 

t R. W. Pohl, Proc. Phya. Soc. 49 (extra part), 3 (1937); F. Seitz, Rev. Mod. Phya. 
18, 384 (1946); N. F. Mott and R. W. Gurney, Electronic Proceaaea in Ionic CryaUUst 
chap, iv (Oxford, 1940). 
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In Fig. 60, if a quantum of frequency hv^ is absorbed, a free electron 
and a positive hole are produced; both can move in the lattice inde- 
pendently, giving (in theory) photo-conduction. The frequency hv^ is 
not, however, the lowest that can be absorbed, because it is possible 
for the electron and positive hole to exist in a series of states in which 
they are bound in one another’s field. If r is the distance between them 




(ft) 



Blocking direction 


(c) 



Direction of 
large current 


{d) 


Fio. 60. Potential energy of electron at metal/non-metal interface: (a) insulator; (6) 
semi-conductor in equilibrium: (c) field in blocking direction; (d) field in opposite 
direction. 


and R the coordinate of the centre of gravity, we shall expect these 
bound states to be of the form 

where 0„(r) are wave functions similar to those for a hydrogen atom. 
An electron and positive hole bound together in this way has been 
termed by Frenkelf an exciton; the formulation given here is due to 
Wannier.} 

If an exciton is formed by the absorption of a quantum of light, it 
will be formed in a state for which k = 0; therefore, owing to the 
Coulomb ,^eld between a positive hole and an electron, we expect the 
absorption spectrum to consist of a series of lines leading up to a series 
limit. In practice tte lines will be broadened by heat motion. 

Attempts have been made to account for the absorption spectrum 
of alkali halides along these lines. 

t J. Frenkel, Fhys. Bev. 37, 17 (1931). 
t G. H. Wannier, ibid. 52, 191 (1937). 
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An alternative description of the exciton may be given, which corre- 
sponds more closely to that of Frenkel’s original paper. In this we 
consider a crystalline array of atoms, numbered 1, 2,..., r,..., n. Let 
denote the wave function of an electron in the ground state of 
the atom r, and the corresponding wave function for some given 
excited state. Let be the wave function describing the 

whole system of n atoms when all the atoms are in the ground state 
except atom r; this can be formed by setting up an antisymmetrical 
determinant from functions of the type 

Then it can be shown that is not a Mionary state of the system, 
but that the stationary states are of the form 

r 

where R,. is the position of the rth atom. A wave function of this type 
represents the place where the atom is excited, moving about the lattice 
with wave-number k. In other words, an excited atom hands over its 
energy to a neighbour, the process is repeated, and so on. The place 
where the excitation is at any moment is called the exciton. 

41.5. C<mt(wt between a metal and an insulator 

If a metal is in contact with an insulator, we may introduce a 'work 
function’ which represents the work necessary to take an electron 
from the metal into the conduction band of the insulator (Fig. 60). 
(I>i will not, of course, be equal to the work function ^ which is defined 
as the work necessary to remove an electron from the metal to vacuum 
across a clean surface. Experimental evidence indicates that usually 
(l>i is less than From the theoretical point of view the image force 
is instead of so the contribution from that term to the 

work function will be very much smaller. 

Given two metals of work functions against a vacuum and 
<l>i against a given insulator, it is tempting to assume that 

but it is doubtful if this is a good approximation, and arguments can 
be given to suggest that has the same sign as but is 

considerably smaller. The field at the boundary of a metal will be 
changed if another solid is brought into contact with it. 

It will be assumed that is always positive. 

A non-metal with no electrons normally in the conduction band will 
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will be formed in a state for which k = 0; therefore, owing to the 
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absorption spectrum to consist of a series of lines leading up to a series 
limit. In practice tte lines will be broadened by heat motion. 
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t J. Frenkel, Fhys. Bev. 37, 17 (1931). 
t G. H. Wannier, ibid. 52, 191 (1937). 
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go over the barrier with the help of heat motion (no tunnel effect); 
a potential difference equal to will remove the barrier preventing 
the motion of electrons from semi-conductor to metal, but does not 
affect the height of the barrier opposing motion in the opposite 
direction. 

The theory of the effect is as follows: 

Figs. 60 (c) and [d) show that, if a potential V^je is placed across the 
barrier, V{x) is still given by (19) with 

The equation governing the current j crossing unit area of the barrier 

penmittimeis 


Here n(x) is the density of electrons at a distance x from the metal, 
V and D are the mobility and diffusion coefficient of an electron. Since 

Djv = kTje, 

this may be written 


dV . dn 

This can be integrated by multiplying both sides by ; we obtain 


j J enxmdx = vkT[n(0)-n(x)y^^»'‘’. 

0 

We now take a value (a^i) of x beyond the barrier; we find 


a*! 

j = I = vkT{n^-ni)e-<^'‘+<l‘'>l'‘^, 

. 0 

where is the density of electrons far from the barrier. 
We note: 

(а) riiV = Gy the conductivity of the semi-conductor. 

(б) Substituting from (19) for F(«), we find 

00 

J ~ J dx 

0 0 

(c) At points where Po = i == Hence 
Hence, finally, 


j = 2<7V{27ri\re®(Fo-f 
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t J. Frenkel, Fhys. Bev. 37, 17 (1931). 
t G. H. Wannier, ibid. 52, 191 (1937). 
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COLLISION PROBLEMS 


The problems to be considered in this chapter are of the following typeif 
a beam of particles, for example, electrons, is incident on an atom or 
atomic nucleus or other scattering centre; it is required to find the 

number of particles that are scattered out ^ 

of the beam in a definite direction. For this ► 

purpose we define the scattering cross-section ^ 

1(6) as follows: if the beam is such that v dat^^ 

particles cross unit area per unit time, then 

the number of particles scattered per unit \ 

time through an angle 6 into a solid angle ^eattering 

do) is vl{B)doj. It will be noticed that 1(6) of particles through an angle 0 

has the dimensions of an area; the scattering ^ 

is the same as it would be if the particles had to hit an area 1(6) du) 

in order to be scattered into the solid angle do). 


It 

The quantity j 1(6) du) = J I (6)27t Bin 6 d6 


is called the total cross-section for scattering. 

It will easily be seen that if the particles are moving through a 
medium containing N scattering centres per unit volume, the proba- 
bility per unit length of path that a particle will be scattered into the 


solid angle dw is 


NI(6)da). 


In the case of electrons incident on atoms, if their energy is greater 
than the first excitation potential, they may be scattered either 
elastically or after losing energy. In such a case we introduce the 
quantity 1^(6) to refer to elastic scattering and 4(&) to refer to scatter- 
ing after excitation of the atom to the state n; the quantity NIj^(6)d<x) 
thus gives the number of particles scattered per unit time into the solid 
angle dw after raising the atom to the nth excited state. 


t General references for the subject-matter of this chapter are : N. F. Mott and H. S. W. 
Massey, The Tfieory of Atomic CoUieiona (Oxford, 1033); J. H. McMillen, * Elastic 
Electron Scattering in Oases’, Rev, Mod. Phys, lly 84 (1939) — a review both of theory 
and e 3 q)erim 6 nt on the subject ; Ha/ndbuch der Phyaikf 24/1 (Berlin, 1933), article by 
H. Bethe, eiq)eoially for discussion of stopping power.. 
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43. Scattering by a centre of force 

The simplest collision problem is that of a stream of particles incident 
on a centre of force; by this is meant a region of space in which the 
potential energy of a particle is not zero but some function V{r) which 
tends to zero as r tends to infinity. If, for example, V{r) is equal to 
ZZ'e^lr, then the problem is that of a beam of particles each with 
charge Z'e incident on an atomic nucleus with charge Ze. To a good 
approximation the scattering of a beam of fast electrons by an atom 
can be treated by taking for the field of the atom a screened Coulomb 
field, such as that for which 

Ze^ 

V{r) = f:iexp(-^r). 
r 


The Schrodinger equation for a particle in such a field is 

VV+^{»F-F(r)}^=0, (1) 


where W is positive, being equal to the kinetic energy of an electron 
at a point remote from the atom. Writing 

= 2mWIP^ U{r) — 2mV(r)lh^, 

this takes the form V^+{B—U(r)}ifi = 0. (1.1) 

To describe the scattering it is necessary to find a solution that will 
have the form of an incident wave and a scattered wave. Thus a solu- 
tion must be sought which, for points distant from the scattering centre, 
has the asymptotic form 

0 ^ terms of order Ijr^. (2) 

If this is multiplied by the appropriate time factor it is seen 

that the first term represents a plane wave advancing from left to right 
along the 2 -axis. This wave describes a beam in which there is one 
particle per unit volume so that the number of particles crossing unit 
area per unit time is v (= hkim). The second term represents an out- 
going wave; in this \f(S)\^lr^ is equal to the number of particles per 
unit volume at a distance r from the origin, and thus the number per 
unit time incident on an area A is 


v\m\Ulr^. 


It follows that the number scattered into the solid angle dcu is v\f(d) \^do) 
and hence that /(gj ^ ( 3 ) 


The Scattering is thus completely determined if/(^) is known. 
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43.1. Mxmt solution of the wave equation representing an incident and 
a scattered wave 

In this section we shall show how a solution of equation (1) of type 
(2) can be obtained. We require first an expansion of the plane wave 



in a series of spherical harmonics. The required expansion is, 
writing s = r cos Q, 


^ikrcoBd ^ 2 (2l+l)i^Pi(0OBd)fi(r), 




where 




w 

(4.1) 


and is the Bessel function of order For the first of these 
Mr) = (kr)-hmkr. 

The asymptotic form of fi{r) is, for large r, 

f(r) ^ {kr)-hm(kr—^l7T). (5) 


Plots of/o(r) and/io(f) are given in Fig. 62 for ^ = 1. 

A proof of these propositions is given in any text-book on Bessel 
functions, t A proof of enough of them for the purposes of this chapter 
may be obtained as follows: equation (4) may be regarded as an ex- 
pansion of the left-hand side in a series of Legendre functions 2^(cos0) 

t For example, G. N. Watson, Bessel Functions, 
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with the functions fi(r) unknown; then multiplying both sides of (4) 
by i^(cos d)sm 6 and integrating with respect to 0 from 0 to tt, we obtain, 
writing cos 6 = 

+1 

fi{r) = ^, j (6) 

-1 

This equation defines /^(r); it also enables an asymptotic expression to 
be obtained; integrating by parts, equation (6) gives 


Thus, since P^(l) = 1 and 1) = (—1)', we find 

fi(r) = (kr)-hm(kr-^l7T), (7) 

which is identical with equation (5). 

Turning now to the equation (1) for a particle in a central field, we 
see that the general solution of this equation with axial symmetry 
must be of the form ^ ^,i>(oo8 e)F,(r), (8) 

where the Ai are arbitrary constants and the function is the bounded 
solution of 


This may be written 


(rf)+|/k*-f7(r)- 




tF = 0 . 


(9) 


It* follows that if U{r) tends to zero sufficiently rapidly as r tends to 
infinity, then the bounded solution must have the form at infinity 

Fi (kr)--hm(kr----^h+r)i). ( 10 ) 


The phase r|^ can in general only be determined by numerical integra- 
tion of equation (9), though approximate methods of evaluating them 
are given on p. 238; the term is added so th^t 7}i shall vanish if 
V(r) is zero. 

The oonistants Ai in (8) must now be chosen so that it shall have 
the asymptotic form (2); in other words, they must be chosen so that 

2il,i?(cos^)P^(f)^e<^ (11) 

shall represent an outgoing wave only. Substituting from (4) for we 
obtain for (11) 


2 i?(ooBfl){4,J5(r)-(2J+l)»y,(r)}. 
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w 
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{d) Various approximate expressions may be found for the phases rji. 
In the first place let us write 

g,{r) = krfiir), 0,(r) = krF,(r). 

Thus, from equation (9), it follows that 


^ jj. . 


Integrating from zero to r, we find 

r 

0 

We assume that U(r) tends to zero faster than 1/r, so that the integral 
on the right converges. Then, inserting the asymptotic forms (7) and 
(10) for Gi and gr^, we see that 

00 

Bmrj, = - j J ^t(‘>')9i{r)U(r)dr. (13) 

0 

This formula is exact. An approximation valid when I is large is to 
replace 0^ by Qi so that 

00 

0 


0 


If this approximation is used for all ?, and sin rjf is put equal to t^/, we 
obtain for f (6) 

m=~j"^V(rydr, (14) 

0 

where K = 2isin|^. 


This follows from the expansionf 

^ = |(2l+l)J?(coBe){/,(r)}*. 

Af 

Equation (14) is known as the ‘Bom approximation* to the scattered 
amphtude; in practice it is seldom a sufficient approximation. Bom*s 
original derivation is given in the next section. 


t G. N. WatflKm, loo. oit., p. 363. 
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An alternative and improved approximate method of determining 
the phases has been given by Pais,t as follows. A suitable solution of 
equation (9) when there is no interaction is 
Sfi(r) = {knhr)^^(kr). 

Pais takes the effect of the interaction into account by using a solution 
of the type G,(r) = 

where A is to be determined from equation (9) by a variation method. 
He is then able to show that 

0 

If this equation is solved graphically or by a method of successive 
approximations, then the phase is given by 

ni = — JttA. 

Hulthent has used a variational method to obtain if 

0 

and 0 is a function containing undetermined parameters c, then opti- 
mum values will be given by the condition dLjdc = 0. Hulthen takes 

^ \jf(r) = cos ri sin (1 +^e~^)sin t] cos kr^ 

with h as an adjustable parameter, in finding the phase tjq with a field 
of the type F(r) = ^br-h'^ used in Meson theory. 

Other methods of obtaining phase shifts will be described in the 
second edition of Mott and Massey, op. cit. (to appear shortly), 
Massey and Buckingham§ have worked out the scattering to be 
expected from a large number of fields which occur in nuclear theory. 

44. Born’s approximation for the scattering of particles by a 
centre of force: alternative treatment 

For the scattering of particles by a centre of force, it is possible, as 
shown in the last section, to obtain an exact solution of the wave 
equation. An important approximate method was first introduced by 
Bom,l| in which the field F(r) is treated as a perturbation. This is 
known in the literature as the Born approximation. 

t A. Pais, Proc. Ccvmb, Phil Soc. 42, 45 (1946). 
j L. Hulthen, Fysiogr. SaUsk. Lund, Forhandl 14, 21 (1944). 

§ H. S. W. Massey and R. A. Buckingham, Proc, Boy, Soc, A, 163, 281 (1937). 

II M. Bom, SSeita, f, Phyeik^ 37, 863 (1926) and 38, 803 (1926). ^e former paper is the 
hrst in which the probability interpretation of 0 was introduced, and is the first dis- 
cussion of a collision problem in wave mechanics. 
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We make use of the fact that a solution of the inhomogeneous 
equation = ■?’(*. S/- *). 

where F(Xyy,z) is a known function, is 

For a proof see, for example, Mott and Massey, op. cit.. Chap. VI, 
or any text-book on electromagnetic theory. 

If we write the wave equation in the form (1.1), namely 
V20+{P_l7(r)}^ = 0, 
and treat U{r) as small, then we may set 

and neglect the product E70i. Substituting for iff we obtain therefore 
VVi+i^Vi = U(r)ei^^, 

of which a solution is 

1 /* gifclr-r'l 

It is easily seen that this solution behaves at infinity like an outgoing 
wave; the asymptotic form is in fact 

pikr 

01 ~ 


where 




4^ 


J ^^mnr)+ikzU^r) dr 


with n = r/r. 

A convenient way of writing this is 


= (16) 

where ko, k are vectors of magnitude k in the directions of motion of 
the particle before and after scattering. 

It will be noticed that the same formula can he obtained by the 
methods of Chapter X for the probability per unit time that a particle 
makes a transition from the initial state with momentum ko and wave 
function to a final state with momentum k. 

Formula (16) may be simplified (when F(r) is spherically sym- 
metrical) by taking polar coordinates with axes in the direction of the 
vector ko— k; we thus obtain 


f{0) = ^ JJ dfl'r**-, , 
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where K = = 2A;8in On integration with respect to 6' this 

gives 

0 

as in formula (14). 

Ze^ 

For example, if F(r) = _exp(-gr) (screened Coulomb field), this 

gives ^2 1 

m = -f!i_cosecH^ - 

^ 2mv^ ^ l+{ql2kBmief 
If qjk is small, the scattered intensity tends to 

which, as seen in the next section, is the classical expression for the 
scattered intensity. 


44.1. 8mttering by a Coulomb field 

If a beam of charged particles, each carrying a charge Z'e, and such 
that one particle crosses unit area per unit time, falls on a single nucleus 
of infinite mass and charge Ze, then according to Newtonian mechanics 
the number of particles I($)dio scattered per unit time through an 
angle 6 into a solid angle do) is given by 


1 ( 6 ): 


f ZZ'ey 

\2mv^) 


cosec^ \6, 


The proof of this, the Rutherford scattering formula, is given in any 
text-book on nuclear physics. 

The same formula is exactly valid in (non-relativistic) wave me- 
chanics, provided 

(a) the Coulomb law of interaction, F(r) = ZZ'e^lr^ is assumed to 
hold down to all distances; 

(b) the velocity of the particles is small compared with c, the velocity 
of light. 

Classical results are also valid for the collision between two charged 
particles of finite mass, except when both particles are identical (two 
a-particles or two electrons, cf. Chapter V). 

For details of the proof of this the reader is referred to the original 
papers or to Mott and Massey, op. cit.. Chap. III. One has to obtain 
a solution of the equation 


«07S 



2me^ZZ' 11 
r 


0 = 0 


B 
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representing an incident and scattered wave. The solution may be 
shown to be ^ ^ F(-ia; 1; ik(r-z)), 

where F is the hypergeometric function defined by 

^ / ^6.1 ^6(6+l)1.2 ^ 

and OL == ZZ'e^jhv, 

The asymptotic form is ifj = I+Sf(d), 

where I represents an incident wave and is given by 

J _ ^ikz+ioilogkir-z) 


and Sy the scattered wave, is given by 

^ _ ^-l^ikr-ioLlogkr 

and f{B) = cosec^ ^6 e»aiofi:(i-co80)+i7r+2i>^ 

where = r(l+ia)/r(l-*ia). 

It will be seen that neither the incident nor the scattered waves have 
the normal form; this is due to the slow convergence of the Coulomb 
field. 

On forming the scattered intensity 1(6) = |/(^)|^ we see that the 
classical formula of Rutherford is obtained. Planck’s constant only 
arises in the phase of /, which is not observable (except in the collision 
between identical particles; cf. Chapter V). 


44.2. Collisions between electrons and atoms 

Up to this point we have considered only the scattering of electrons 
by a centre of force. We have now to outline a theory of the scattering 
of a beam of electrons by an atom, in which the interaction between 
an incident electron and the electrons in the atom is taken into account. 
In the subsequent analysis we shall discuss the scattering of a beam 
of eleckons by a hydrogen atom; the extension to a many-electron 
atom is straightforward. 

We denote the Schrodinger equation for the hydrogen atom by 
{H(p,r)^W}ils(r) 0, ^ 

where H ^ the usual Hamiltonian operator and r the coordinate of 
the electron. Let be the characteristic energies and correspond- 
ing wave function. Then the Schrodinger equation for an additional 
electron interacting with the atom is 

{-£v?+ir(p„r,)+F(r„r*)-lf)T(r„r,) == 0. 


(16) 
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where K = = 2A;8in On integration with respect to 6' this 

gives 

0 

as in formula (14). 

Ze^ 

For example, if F(r) = _exp(-gr) (screened Coulomb field), this 
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If qjk is small, the scattered intensity tends to 


which, as seen in the next section, is the classical expression for the 
scattered intensity. 
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If a beam of charged particles, each carrying a charge Z'e, and such 
that one particle crosses unit area per unit time, falls on a single nucleus 
of infinite mass and charge Ze, then according to Newtonian mechanics 
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angle 6 into a solid angle do) is given by 

The proof of this, the Rutherford scattering formula, is given in any 
text-book on nuclear physics. 

The same formula is exactly valid in (non-relativistic) wave me- 
chanics, provided 

[a) the Coulomb law of interaction, V(r) = ZZ'e^lr^ is assumed to 
hold down to all distances; 

(b) the velocity of the particles is small compared with c, the velocity 
of light. 

Classical results are also valid for the collision between two charged 
particles of finite mass, except when both particles are identical (two 
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For details of the proof of this the reader is referred to the original 
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representing an incident and scattered wave. The solution may be 
shown to be ^ ^ F(-ia; 1; ik(r-z)), 

where F is the hypergeometric function defined by 

^ / ^6.1 ^6(6+l)1.2 ^ 

and OL == ZZ'e^jhv, 

The asymptotic form is ifj = I+Sf(d), 

where I represents an incident wave and is given by 

J _ ^ikz+ioilogkir-z) 


and Sy the scattered wave, is given by 

^ _ ^-l^ikr-ioLlogkr 

and f{B) = cosec^ ^6 e»aiofi:(i-co80)+i7r+2i>^ 

where = r(l+ia)/r(l-*ia). 

It will be seen that neither the incident nor the scattered waves have 
the normal form; this is due to the slow convergence of the Coulomb 
field. 

On forming the scattered intensity 1(6) = |/(^)|^ we see that the 
classical formula of Rutherford is obtained. Planck’s constant only 
arises in the phase of /, which is not observable (except in the collision 
between identical particles; cf. Chapter V). 


44.2. Collisions between electrons and atoms 

Up to this point we have considered only the scattering of electrons 
by a centre of force. We have now to outline a theory of the scattering 
of a beam of electrons by an atom, in which the interaction between 
an incident electron and the electrons in the atom is taken into account. 
In the subsequent analysis we shall discuss the scattering of a beam 
of eleckons by a hydrogen atom; the extension to a many-electron 
atom is straightforward. 

We denote the Schrodinger equation for the hydrogen atom by 
{H(p,r)^W}ils(r) 0, ^ 

where H ^ the usual Hamiltonian operator and r the coordinate of 
the electron. Let be the characteristic energies and correspond- 
ing wave function. Then the Schrodinger equation for an additional 
electron interacting with the atom is 

{-£v?+ir(p„r,)+F(r„r*)-lf)T(r„r,) == 0. 


(16) 
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rearrangement collisions, it will be seen that the field P^(r) responsible 
for elastic scattering is just the electrostatic field due to the charged 
nucleus and a uniform negative charge distribution —e\iljQ(r)\^. 

For hydrogen this may be evaluated to give 

»o/ 

where = h^/me^ = 0*54 x 10“® cm. For other atoms the best available 
approximation is to use the potential given by the method of the self- 
consistent field (Hartree field).! 

For atomic hydrogen an analytic expression for the elastic scattering 
may be obtained; this is 

im = 4(8+a§ji:^)^ 

al{4+alKY 

where iT = Ik^— k^l = 2Ajsin 

It will easily be seen that an improved approximation to the elastic 
scattering is obtained by finding the exact solution of 

vv+(*^-^Kk.w)i^ = o 

instead of using Bom’s approximation; this amounts to inserting 
Fo(ri)^o(r 2 ) instead of in the right-hand side of (17). This 

method is the one which has actually been most used in calculating 
electron scattering in gases. 

For references to the many papers on the subject up to 1938, see the 
second general reference at the beginning of this chapter. Among more 
recent work Masseyf and his co-workers have estimated a self-consistent 
field for methane by averaging the proton distribution over all direc- 
tions, and have used this field to calculate the scattering of slow 
electrons; the results are in quite good agreement with experiment. 

' 44.3. Relative intensitm of elastically and inelastically scattered 

particles 

The following considerations show that it is impossible to obtain 
inelastic scattering from an atom (or nucleus) unless there is some 
elastic scattering present too. We shall confine ourselves to the case 
where the dimensions of the nucleus are small compared with the wave- 
length, so that the scattering is spherically symmetrical. 

t Cf. Chapters V, VI ; an analytic expression for the field of helium is given by E. A. 
Hylleraas, ZeUs.f. Physik, 54, 347 (1929). 

X R. A. Bucki^ham, H. S. W. Massey, and S. R. Tibbs, Proc, Boy. Soc. A, 178, 119 
(1941). 
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A spherically symmetrical solution of the wave equation of the 
system must then have the asymptotic form 

T(r,g) = r-^[e-^^tlfQ(q)+Ae^^riljo(q)+Be^^^^^ (19) 

Here ^e(^) represent the wave functions of the atom (nucleus) 
in its normal and excited states, k, k* the wave numbers of the elastically 
and inelastically scattered particles. The wave function representing 
an incident plane wave and scattered wave is 

where a, chosen to make terms in vanish, must be — l/2iA;. Thus 
the elastically scattered intensity is 


h ■= 3p(H-l)- 


and the inelastically scattered intensity 


— - 151 *. 


Since, however, the conservation of charge applied to equation (19) 


gives 


1 = 


we see that cannot vanish unless A = — 1 and 4 vanishes too; and 
if /g has the largest possible value (A = 0,/g = 1/4A;^), then 

k = 1/4^^ 

Such a formula would, for instance, apply to the scattering of slow 
neutrons by a nucleus of very strong absorbing power. 



INTERACTION OF RADIATION WITH MATTER 

45. Introduction 

In this section we shall show how to solve problems of the following 
type. An atom, or molecule, or any other system, is initially known 
to be in a given stationary state, denoted by the suffix zero, so the 
energy of the state is Wq. It is then acted on by a variable external 
field for a certain time, say by the field of a passing a-particle or gas 
molecule, or by a light wave. At the end of that time it may have 
made a transition to any other state, with energy We require to 
calculate the probability that the transition has occurred. 

The physical principles by means of which the calculation is made 
are as follows: The wave equation for the unperturbed system (atom 
or molecule) is 

where is the Hamiltonian operator of the unperturbed system. The 
solutions of this equation are 

and the system is supposed initially, at time ^ = 0, to be in the state 
with wave-function %. In the presence of the perturbing field let the 
wave equation be 

= ( 1 ) 


The quantity U is the addition to the Hamiltonian due to the perturbing 
field, and will be a function of q, djdq, and of the time. For instance, 
if we are dealing with the perturbation of a hydrogen atom by the field 
of a passing a-particle, U is the potential energy of an electron in the 
field of the a-particle. 

The equation (1), being of the first order in <, enables us to determine 
Y at any subsequent time L Suppose at time t we expand 'F(g, t) as a 

= (2) 


Then, by the usual interpretation of wave mechanics, the quantity 


given by 


Pn = 


is the probability that at time t the system is in the state n. 



248 INTERACTION OF RADIATION WITH MATTER § 46 

To obtain the coefficients a„(<), we substitute (2) into (1); we obtain 

n 

Multiplying both sides by and integrating over all g, this 

equation reduces to 

in^ = jY*(q,t)UY(q,t)dq. (3) 

To obtain ah explicit solution we must in general assume that the 
perturbation is small, i.e. that during the perturbation the change in 
T(g, t) is not large. With this assumption we replace T‘(g, t) in the right- 
hand side of (3) by its original form we thus obtain 

in^==jy¥t(q,tm{q,t)dq. 

Writing J ’F*(g)£7T„(g) dq = (n|C/|0), 

t 

we see that a^{t) = ^ J (4) 

0 

We note that in this expression (?i|C/|0) is in general a function of L 

Certain generalizations can be made at once from the expression (4): 

(а) If the perturbing field varies slowly with time, so that many 
oscillations of the exponential function in (4) occur while U 
remains sensibly constant, the transition probability will be small. 

(б) If f7 is periodic in the time with period v (as, for instance, if f7 is 

the perturbing energy due to a light wave), then will not 
increase with the time unless hv = We see therefore that 

for the excitation of atoms or molecules by light the Bohr fre- 
quency condition can be deduced from the theory, and that if 
V is independent of the time, transitions to states of equal energy 
only occur. 

45.1. Excitation of atoms by a 'passing a-particl^ 

In this section we shall show how to calculate the probability that 
an atom fs excited by an a-particle which passes a long way from it. 
Actually the first attemptf to calculate the stopping power of matter 
for a-particles was Inade in this way; but the method used in Chapter 
IX, in which the motion of the a-particle as well as that of the electrons 
is treated through a wave function, is much more convenient. The 
t J. A. Gaunt, Proc, Comb, Pha, 8oo, 23, 732 (1927). 
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calculation is included here as being the simplest and most instructive 
example of the calculation of transition probabilities, not for its practical 
value. 

The a-particle is supposed to move with constant velocity v past a 
hydrogen atom at the origin of coordinates, passing it at a distance p. 
If we take ^ = 0 to be the moment when the a-particle is nearest to 
the atom, then the perturbing energy U at any other time is, for an 
electron at the point (x, y, z) 

—Ze^ 

^ ^ 4{X-x)^+pr-yn{Z-zn 

Here Ze is the charge on the a-particle, and X, 7, Z are the coordinates 
of its position at time t, given by 

X = Vty 

r = p, 
z = o. 

Expanding U for small values of a;, z we have 
TT ■■ xX-\-yT \ 

iJ L J‘ 


Since we have assumed that p is large compared with the radius of the 
atom, only the first terms in the expansion need be taken. 

The quantity ajif) is thus given, after the a-particle has passed 


(^-> 00 ), by 





— 00 


For the evaluation of these integrals, the reader is referred to the paper 
by Gaunt already quoted. 


45.2. Transitions to unqmntized states 

A perturbing field (e.g. the field of a passing charged particle, or a 
light wave) may ionize an atom or molecule; in other words, transitions 
may be caused to states with imquantized energy. To calculate the 
appropriate transition probabilities, the simplest procedure will be to 
introduce fictitious quantization by enclosing the whole system in a 
large box, of side L, say. For instance, suppose the final state of the 
particle is represented by the wave function of a free particle 

T,(r) = e«^) 

Then if we introduce our box of side L, together with the boundary 
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condition that shall be periodic with period L, the normalized wave 
function is 

(k = 2irn/Ir; n = (%, Wj, Wj)). 

If we write (»|C/|0) = ^ J e~^’">f7’F(,(r) dr, 

then the transition probability to the state n is given by \a.n(t)\^ with 
a^it) given by (4). For values of k lying in the limits between and 
ki+dk^y k 2 and kz+dk^^ k^ and k^-{-dk^, the number of states is 

L^^kidk^dk^l{2'jTf. ( 6 ) 

. Thus if (ifcl tJ' 10) = J e-^V%(r) dr, 

the chance that the electron is ejected into a state with wave numbers 
in these limits id 


where a'{t) = 1 J (Z|{7'|0)e«»’*-W/A dt 

0 


dk-j^ dk^ dk^ 
"~(27r7“ 




45.3, Transitions due to a perturbation periodic in the time 
This case is of great importance because of its application to the 
action of light waves on atoms and molecules. 

We suppose that the perturbing term which has to be added to the 
Hamiltonian of the system is of the form 

U = aco8(2w<+€), 

where a, c are functions of p and q but do not contain the time. It is 
convenient to write this in the form 


where F is a complex function of p and q and F* is its complex con- 
jugate. Formula (4) then gives for the probability that at time t a 
transition has been made to the state n of energy 


■P„='K( 0 I*, 

where ' 



( 7 ) 
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Here (nl^lO) = J ^:{q)F(pMq)dq, 

which is not a function of time. 

Unless \Wn-W^\ = 27r^v, (8) 

the integrals in (7) do not increase with the time. The sudden im- 
position of the field will cause transitions, but after a time equal to 
Ijv the probability that a transition has taken place does not grow any 
greater. Only values of the frequency for which (8) is satisfied need 
therefore be considered. 


If we write = (WQ—WJI^irh 


and suppose that v lies in the neighbourhood of Vg,,, then only one of 
the two terms of (7) will be significant. Carrying out the integration 
with respect to t we find ' 


If V is equal to vqrj th© probability PJf) that a transition has occurred 
increases as the square of the time, instead of linearly with the time 
as one would expect. However, this apparent contradiction can be 
resolved as follows: 

If the final state is quantized, we shall have to consider perturbation 
by radiation with a band of frequencies extending over the natural 
width of the absorption line. This is done in §46 and it is shown that 
a transition probability proportional to t is obtained. In this section 
we suppose that the final state is unquantized — in other words, that 
the calculation refers to the ionization of an atom by a light wave or 
other periodic disturbance. The fictitious quantization of §46.2 may 
be introduced, the atom being enclosed in a ‘box’ of side L, Thep 

\(n\Fm^ J e-«>^F%(T)d^\ 

and the number of states for which the particle is moving with its 
momentum in the solid angle dQ about the vector k, and with energy in 
the range dWy is by (5) 

d^lm^h~^2W)^ dW, = p(W)dW, say. 

To obtain the probability that, after time t, the particle is ejected 
in a direction lying in the solid angle dQ, we have to sum over the 
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transition probabilities to all these states; this gives, making L tend to 
infinity so that the summation is replaced by an integral, 

dajp^mwM- 

Now the quantity P^{t) has a very strong maximum when passes 
through the value for which vanishes, and for large t practically 
all the value of the integral comes from here. Thus in all terms in 
p(WJPn(t) except the term involving the time factor, we may set 
equal to this value, i.e. 

= W^-\-2TThv, 

The time factor gives 

r 2{l-COB2ir(v->'o,J<} 

J {2ir(v-Vo„)}i* 

or with 27r(v— Vort) = 

r 2(1-00^^ 

J X* 

When t is large the limits of integration may be replaced by ±oo and 

2(l-cosy) , 

t 

The integral is equal to 2 it; thus, finally, the probability that a transition 
will have taken place after time i is 

mmF\omWnh (lo) 



which increases linearly with t as it should. 

The probability per unit time that the electron is ejected with its 
momentum in the solid angle dQ, is 


dQ 


mv 


J 




45.4. Trarmtions caused by a perturbing term independent of the time 
This is a special case of the problem considered iq the last section, 
the frequency v being taken to be zero. 

Formula (9) remains valid. Since U is now independent of i, it may 


Here U is the perturbing potential and the matrix element (w|17|0) is 
formed with normalized wave functions. When t is large the function 
on the right has a strong maximum for equal to zero, i.e. for transit 
tions in which energy is conserved. 
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If, as before, tve introduce the function p{W)dW to denote the number 
of states with energy between W and dW (and if necessary subject also 
to other conditions, such as having the direction of momentum in a 
given solid angle), then the probability after time t that a transition 
has occurred is 

f FJt)p(W)dW. 

As t tends to infinity, this gives as before 

tMU\omw)iri ( 11 ) 

for transitions in which energy is conserved. For other transitions it 
does not increase with t. 

The following is an example: A beam of electrons is incident on a 
centre of force in which the potential energy of an electron is F(r). 
If we normalize %(r) to represent a beam such that one electron crosses 
unit area per unit time, then 

To(r) = 

Equation (10) then gives for the number of particles scattered into the 
solid angle do) per unit time 

eMrv{r)dr'du>, 

which agrees with the formula found by another method in Chapter 
IX, equation (16). 

46. Emission and absorption of radiation 

The correct description of the intensities of spectral lines according 
to the quantum theory was first given by Einstein. f Corresponding 
to the transition between any two states n and m of an atomic system, 
he introduced three probability coefficients and The 

state n is supposed to be the state of highest energy, and the coefficient 
-4^^ is defined as follows: A^^^dt is the probability that an atom, initially 
in the state n, will make a spontaneous transition in the time interval 
dt to the state m. This probability was supposed to be independent 
of the past history of the atom or of the process by which it had been 
brought to the state n. 

The coefficients and B^^^ are defined as follows: suppose the 
atom is in the presence of radiation, unpolarized and incident equally 
in all directions, such that the energy density with frequency between 
V and v+dv is I{v)dv, Let be the frequency corresponding to the 
transition between the states n, m. Then if the atom is in the state m 
t A. Einstein, Phye, Zeito. 18, 121 (1917). 
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(the lower state), the chance in a time interval dt that it will make a 
transition to the state n with the absorption of a quantum of radiation 
is Also, if it is in the upper state n, the chance that in 

the presence of the radiation it will make a transition to the ground 

Einstein was able to prove the following relationships: 

(12) 

The proof is as follows: Suppose a number of the atoms concerned are 
in thermal equilibrium in an enclosed space at temperature T, together 
with the (black body) radiation in equilibrium at that temperature. 
Then as many atoms must make the transition upwards as make the 
transition downwards. Hence 


where are the numbers of atoms in the two states. Now in 

thermal equilibrium, by Boltzmann’s laws, 


^ _ ^-(Wn-Wm)ikT 

N 


which by the Bohr frequency condition gives 


N 

i!" = e-^^'nJkT 

Nm • 


Thus 


= 




Comparing this with the well-known formula for the intensity of black 
body radiation, 1 


we see that equations (12) are satisfied. 

In attempting a quantal theory of the emission and absorption of 
radiation we are faced with the following position: The theory of 
absorption and stimulated emission is relatively Elementary; the light 
wave cantbe treated as an electromagnetic field periodic in the time, 
and the probability that the atom will make a transition from one 
state to another under its influence can i)e calculated by the methods 
of the last section. As we have seen, the Bohr frequency condition can 
be deduced from the theory, and the values of the Einstein B coefiicients 
obtained. The A coefficient can then be deduced from equation (12). 
But this elementary theory does not by itsdf predict spontaneous 
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emission; in the absence of a field one cannot see why transitions should 
occur at all. To account for spontaneous transitions one must intro- 
duce a more complete theory in which atom and radiation are both 
considered as one system, and the radiation itself is quantized. 

The elementary theory will be given in the next section; the more 
complete theory in § 47 and the following sections. . 


46 . 1 . Elementary theory of absorption 

As explained in the last section, in this section we shall treat the 
action of a light wave on an atom as that of a perturbing electro- 
magnetic field. Such a theory will account for absorption, but not for 
emission. 

A plane polarized light wave moving along the z-axis can be written 
in the form 

6 sin 2 ttv ( 




iH 


(13) 

H^ = Ey = 0 . 

It is convenient to describe this by a vector potential A and scalar 
potential (^, from which E and H can be deduced from the equations 

laA 


E= ~- 


“grad^, 


/ 

If we set 


H = curlA. . 


Aj. — acos27rv|~— (a = 6c/27rv), 


Ay Aj. 


0, 


it is easily verified that (13) is satisfied. The energy density I in the 
wave is given by 


/ = 


E2+H2 _ TraV 
2c2 * 


Stt 


By equation (46) of Chapter I, the perturbing term introduced by 
a vector potential A for which div A = 5 : 0 is 


imc 


(A grad). 


In our case this gives 


^^L27ri»»(«/c-4)^_j_complex conjugate). 


2tmc( 


dx 


I 


Thus the probability that after time t the atom has made a transition 
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from a state with wave-function to a state with wave-function 
is given by (9) with 

\(n\F\0)\^^ 

Y*Q27Tivzlc^^0 

” dx 


where 


Qn- j 



Substituting in terms of the energy density we find for the probability 
Pn(t) 


e- 


TU.\n 2{l-cos2^7(y-vo„)0 


As in the previous section, if we set v equal to we find that 
increases as the square of the time. We have therefore to consider that 
the atom is irradiated by a band of frequencies, with energy density 
/(v)dv in the range dv. Since 


f 2{l-COB2?r(v-V(^)t} , ^ 


we see that the probability per unit time that the atom makes a 
transition from the state 0 to the state n is 


It will be noted that the 2 -axi 8 is the direction of propagation of the 
wave, the a;-axis the direction of polarization. 

The Einstein jB-coefficient will be obtained by averaging over all 
directions of incidence and polarization. 

The integral f dn- (16) 

J dx 

thus determines the absorption and emission intensities. This quantity 
can be simplified, making use of the fact that for light absorbed in the 
line spectrum of an atom, the wave-length cjv of the light is large com- 
pared with the diameter of the atom. We may thus expand 

To a first order of approximation, therefore, the integral (16) reduces to 

/ ■r.f . 

By equation (21) of Chapter VII this may be written 
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Thus the probability of absorption takes the form 

Averaging over all directions of polarization we obtain for the 


^-coefficient 


■® = ^(l*0nP+l3^0«l*+|20nl*). 


and by (11) for the A-coefficient 

It is convenient to express these quantities in terms of oscillator 
strengths/^ (Chap. VII, p. 169); we obtain 

= e^fjQhmv, A = STr^cV/JSmc^. (16) 

This form is convenient because is dimensionless, and because 

2/n=l, 

where the summation is over all transitions which start from a given 
state. 

46.2. Selection rules 

Suppose that the state 0 has orbital quantum numbers l^u; in other 
words, that the part of the wave function involving ^ is 

Pf(cos0)e^“^. 

Suppose also that the state n has quantum numbers l\ u\ Then the 
matrix elements will contain integrals of the type 

COS0 

Pjf'(cos d)e-^^'^ sin 6 cos ^ P5*(cos d)e'^^ sin 6 
sindsin^ 

It is obvious that this integral vanishes unless 

= ±1 or 0. (17) 

The integral also vanishes unless 

l-V = ±1. (18) 

This is shown in the appendix to this book; it is easily verified in the 
simple case where I = 0. 

Equations (16) and (16) give the well-known selection rules for optical 
transition. If they are not satisfied the ‘dipole moments’ etc., 
vanish, and to obtain the transition probability we must then take the 
second term in the expansion (14); the integral (15) becomes 


// 


2w r 
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Transitions for whicli this quantity does not vanish are said to possess 
a ‘quadripole’ moment. It will be seen that for transitions with a 
quadripole moment the transition probabilities are smaller by a factor 
of order (radius of atoms/wave-length of light)® than for those with a 
dipole moment. 

It is worth remarking that there is one type of transition for which 
the transition probability vanishes exactly; this is for transitions from 
one 5-state to another, so that both % and are spherically sym- 
metrical. 


46.3. Alternative treatment of emission 

For certain purposes, especially some of the applications of rela- 
tivistic quantum theory in Chapter XI, it is very useful to have an 
expression for the electromagnetic field radiated by an atom. The 
treatment given in this section is semi-classical, and can only be 
justified on the basis of quantum electrodynamics; it is none the less 
very useful. 

Suppose that in an atom there exists a charge density given by 

j p = po(».y.*)e'*^’"+c.c. 1 

and a current j . (19) 

Then the electromagnetic field radiated by this charge and current are 
given by the equations 






i? = _ 


imp, 


c* a* c ’ 


Here Af^y A are the scalar and vector potentials. If we consider retarded 
potentials only, obtaining thus a solution representing an outgoing 
wave, the solution is 

r p2iriv\r'^rllc 

(nth a similar expression for A. For large r this Incomes 

Ao = j (20) 

Here n denotes the vector r/r. 

Suppose now that an atom is in an excited state 0, and that we wish 
to know the field that will be radiated into transitions to lower states. 
We shall make the foUovdng new assumption: that corresponding to 
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any lower state n, the field radiated initially is that which would be 
given classically by a charge density and current density given 
by (19) with 

jo = |(’i;gradYJ-'r*gradT„), 

hv 

For the corresponding expressions using Dirac’s relativistic equation 
cf. p. 309. 

It may easily be verified, using Schrodinger’s equations, that the 
continuity equation is satisfied: 

divj + ^ = 0- 

. The field radiated is then given by (20). 

It may be verified, by obtaining the energy radiated per unit time 
and dividing by hv, that equation (16) for the ^4 -coefficient is obtained. 

47. Proper treatment of the radiation problem : quantization of 
the field 

We have seen that an elementary theory, in which the electro- 
magnetic field is treated as a perturbation acting on an atom, can 
account for the absorption but not for spontaneous emission. To 
account for this, it is necessary to quantize the electromagnetic field 
and to treat the whole assembly, atom and radiation, as one dynamical 
system. 

The theory, due originally to Dirac,t is based on the following idea: 
The radiation is thought of as shut up in a box; there will thus be a 
series of normal modes each with its discrete frequency v. We know 
from the experimental evidence reviewed in Chapter I that the energy 
of light of frequency v can change only by multiples of hv; and thus it 
involves no great step forward to treat each normal mode as a vibrating 
system, with wave functions appropriate to a simple harmonic oscillator. 
A transition in which an atom emits or absorbs radiation is thus treated 
as a transition of the whole system in which the appropriate normal 
mode of the radiation changes its quantum number by unity. The 
perturbing energy (not a function of the time) is the interaction between 
the atom and the radiation. 

We have first to set up the Hamiltonian of the radiation field in the 
t P. A. M. Diiao, Pfoe. Boy, Soe, A, 114, 248 (1927). 
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absence of any charged particle. This field is conveniently described 
by a vector potential A chosen so that the scalar potential vanishes 
*^everywhere. The electric and magnetic vectors are then given by 

E = —A/c, H = curl A. 


A typical standing wave, representing radiation shut up in a box, may 


be written 


A = j8esin(xr)cosfcc^. 


(21) 


Here e is a unit vector, j8 an arbitrary constant, and 1[k any length. 
In a volume F the number of normal modes with given polarization, 
frequencies between v and v+dv, and direction x of propagation in the 
solid angle dfl isf 

Vppdvd^ = Fv*df2dv/c®. (22) 

The function py == will be called the density function of the 
radiation. 

We now write instead of (21) 

A = -y/(87rc*)esin(xr)g(^). (23) 


The quantity q(t) is the dynamical variable which specifies the state 
of the vibrating normal mode. We shall treat it exactly as the dynamical 
variable of an oscillator according to quantum mechanics. For this we 
need the Hamiltonian and hence the wave equation of the radiation 
fi^ld. The purpose of the normalizing factor -^(Sttc^) will appear below. 

The electric and magnetic vectors E and H derived from the vector 
potential A (23) are 

E = -V(8;r)e8m(xr)g(«) \ . 

H = ^(8wc*)(xxe)oos(jcr)g'(<) I 


Now since x and e are perpendicular vectors and e is a unit vector 

(xxe)* = »c*. 


Making use of this result and of the fact that the mean values of 
Bin®(xr) and oos®(xr) are i, we find that the energy of this wave is 

TT = 1 J (E*+H»)rfr = 4(f +c»4»). (26) 

The momentum conjugate to the coordinate q in the Hamiltonian 
.formalism is 

so that, putting 

t Of. J. H. StOM, Dynamieal Theory ef Qam, 4th ed., ohap. xvi (Cambridge, 1926). 


dW . 
cV* = 4»rV, 


(26) 
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we have for the Hamiltonian of this vibration 

PT = ^(p2+47rVV). (27).; 

It follows that, with the normalizing factor as chosen, the vibrating 
normal mode behaves like a simple harmonic oscillator of mass unity. 
This is true either in classical or wave mechanics. 

In wave mechanics we may set up the usual Schrodinger equation 

= (28) 

The possible values of the energy will be 

K = (29) 

where n is an integer, and the wave functions are the Hermitian 
polynomials discussed in Chapter II and the Appendix. 

The pure radiation field may then be described by the superposition 
of fundamental vibrations of the type (23) with a range of frequencies 

The number of frequencies lying in any given range is determined by 
the formula (22), but the vectors e^, are considered to be distributed 
at random. The Hamiltonian of the radiation field is obtained by 
taking the sum of all the contributions Wg of the type (27); thus it isf 

w = = (30) 

8 8 

In the quantum theory of radiation it .is often more convenient to 
replace the cosine and sine waves representing the radiation field by 
their complex exponential forms, which represent waves moving in 
definite directions instead of standing waves. In this representation we 
write for the vector potential 

A = 2A. = 2{a.?.W+a???(0}. (31) 

8 8 

where the field coordinate qjif) is now complex and 

a, = »(**. ■*). (32) 

The Hamiltonian of the radiation field can be calculated as in the case 

of real wave functions; it comes out to be 

w = l (ftg?+4^M?.g?) = 1 (P.P?+4irM g.g?). (33) 

8 8 

t It can eaedly be shown that the sum on the right-hand side of (30) representing the 
sum of the energies of all the oscillators is equal to the energy of t^ radiation field. It 
should also be observed that only the radiation field can be represented by a super- 
position of stationary waves in this fashion; such a representation is not possible for 
radiation in the presence of an external electromagnetic field. 
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Now since is a solution of the harmonic equation 

it follows that • 

we may thus write the Hamiltonian (33) in the form 

= 8 ^ 2 •!?.??• 

$ 

This may be reduced to the form for the real variable 
by the substitutions 



47.1. Hamiltonian of the complete system 

We have now to determine the Hamiltonian of the atom and the 
radiation field considered as a single mechanical system. The non- 
relativistic motion of an electron in an electromagnetic field described 
by a vector potential A and a scalar potential is uniquely determined 
by the HamUtonian ^ _ e^/c+(P_eA/c)*/2m, 

where P denotes the momentum of the electron, m its mass. Writing 
V for ^/c, expanding the scalar product (P— eA/c)®, and retaining only 
terms of order 1/c, we have 

s = P2/2m+eF-e(P.A)/mc. (34) 

The Hamiltonian of the complete system comprising the atom and the 
radiation field is obtained by adding to (34) the Hamiltonian of the 
radiation field, equation (30) or (33) above, and by substituting for 
the vector potential A the corresponding expression (23) or (31 ). Substi- 
tuting from equations (31) and (33) into equation (34) we obtain for 
the Hamiltonian of the coupled system 

S = $o+Si> (36) 

where So = 2 (36) 

^m a 

is the suifi of the Hamiltonians of the pure radiation field and of a 
particle moving in a field free from radiation, and 

$ 

is the energy of interaction between the electron and the radiation 



§ 47 PEOPER TREATMENT OF THE RADIATION PROBLEM 263 

The transition probability for a transition in which the electron 
jumps from one state to another, emitting or absorbing a quantum, 
can now be written down at once from formula (11). Suppose that the 
direction of propagation of the light wave lies in a solid angle dQ about 
the direction x. Let this wave have N quanta before the process, N' 
after, and let the suffix 0 denote the initial state of the atom and the 
suffix n the final state. Then, substituting for U in the formula (11), 
the probability per unit time that the transition occurs is 

i|(iV',nis,|iyr,o)iv(Pr), (38) 

where and „ are solutions of the unperturbed wave equation 

(39) 

and in this case 

_?^{gei(«r)+g*e-««5}(e.P), (40) 

m 


47.2. Calculation of the matrix elements 

In the last section we showed that, in order to obtain the transition 
probability of a process involving the emission or absorption of radia- 
tion, we have to consider the solutions of the wave equation (39) of the 
unperturbed system. Suppose that is an eigenfunction and the 
corresponding energy value of the Schrodinger equation 


and that is an eigenfunction and a corresponding energy value 
of the wave equation 

Then the function — (41 ) 


is an eigenfunction of the Schrodinger equation (39) corresponding to 


the eigenvalue 


^N,n ' 




(42) 


Substituting from (40) and (41) into the equation defining the matrix 
element 0), we obtain the equation 


2V7re 


[(n|P,|0) J ^^?^jvd?+(n|P*|0) J (43) 


m 
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where the matrix elements (niii|0) and (wIPi'IO) are defined by the 
formulae 

WP*|0) = J'F;c-«“)(e.P)%dr 

Since the functions are the eigenfunctions of the wave equation 
for a linear oscillator, it follows that the integral 

vanishes unless |^— ^'| = 1 and also that 

J (46) 

The energy corresponding to the eigenfunction is 

W = (46) 

Substituting from equation (46) into equation (43) we see that the 
matrix element (^',ii|§i|-^,0) is different from zero only if^' = 

We thus obtain for the two non-vanishing matrix elements 

(JV-l.n|§,|jy,0) = -^(vf j T*e-«“)p.%dr 

where Pe = (Pe) 

denotes the component of the vector P in the direction of the polariza- 
tion. 

For a process in which a quantum is emitted, N is zero, and for the 
absorption of a quantum iV = 1. Thus the transition probability for 
a transition in which an electron jumps from one state (0) to another 
(n) with the emission of a quantum of frequency v is 

-^11 , ( 48 ) 

and the transition probability for a similar process in which a quantum 
is absorbed is 

;i&| 

If the final state of the electron belongs to the continuous spectrum 
PfydW == mcpdWdQlhh^, 
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where p is the momentum of the electron. In that case (38) becomes 


2npe^dQ( C 

hW U 


Y*e-^-^p,YochJ. 


(49a) 


48. First-order radiation processes 

48.1. The emission of radiation 

As a first example of the application of the general theory developed 
in the preceding sections we consider the emission of light by a hydrogen 
atom. Suppose that initially the electron is in an excited state in which 
its wave function is and that it falls to the ground state with the 
emission of a quantum of radiation of frequency v. The wave function 
of the electron in the ground state is Tq, say. Then the roles of the 
suffixes n and 0 are interchanged in equation (48). It follows from 
equation (8) that the transition probability is negligible unless the 
frequency of the emitted radiation is 


- = (Wr,~Wo)IK 

which is the value given by Bohr’s frequency relation. 

The square of the matrix element occurring in the expression (48) 
can be reduced easily to a familiar form. In general the wave-length 
of the radiation emitted is large in comparison with the dimensions of 
the atom so that the function expi(xr) {k = 2ttvIc) will be practically 
constant over that part of the space in which the wave functions of 
the electron differ appreciably from zero. We may therefore write 

J J 

If, now, we write ^ = mv 


and denote by 0 the angle between the velocity vector v and the 
direction of polarization, we obtain 

J ^ip,\dr = mcos0 J T*vT„(ir, 
so that |J = to*cos* 0 jj Yf 

Now J = 2mv J = 27ni'(0|x|») 

and similarly for the y- and z-oomponents of the velocity v. Thus 

I J = 47r*r*TO*cos*0|(O|rln)|» (60) 

where |(0|r|ii)|* denotes the sum 

(01x|»)*+(0|y|n)*+(01z|»)*. 


( 61 ) 
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Equation (48) then gives for the chance per unit time that a quantum 
is emitted into the solid angle dQ 

^^g^cos20i(Oir|w)12da (62) 


Multiplying this expression by hv and integrating over all angles we 
obtain an expression for the total intensity of the radiation emitted 
by the atom in unit time. The summation over the directions of 
polarization replaces oos®0 by sin*^, where 6 is the angle between the 
direction of propagation x and the position vector r of the electron; 
making this substitution in (62) and integrating over all possible values 
of the angles, we find that the total intensity radiated per unit time is 


647r*eV* 

3c» 


mr\n)\\ 


(63) 


This is the same formula as that given on p. 267. 

In the classical theory, if we take as a model of the light source a 
single electron bound elastically to a massive charge and moving with 
simple harmonic motion of frequency v in a straight line with amplitude 
r, we find that the energy emitted per unit time is 




3277^6 V -2 
3c« 


(64) 


We see therefore that the radiation is the same as that which would be 
given out by a classical dipole of amplitude r given by 

f» = 21(01r|n)|*. 


48.2. The photodectric effect 

We saw in the last section that when light is emitted by an atom 
its frequency v is given by the equation 

v = (ir^~w;)/A, (66) 

where are two energy levels of the atom. In a similar way it 

can be shown that light of frequency v can be a^bsorbed by an atom 
only if V satisfies a relation of the type (66). Another process is possible, 
however, for light of such a frequency that hv exceeds the ionization 
potential, J. It may then happen that the electron absorbs a quantum 
of radiatiqn of sufSicient energy to cause it to leave the atom with 
kinetic energy Av— /. We can easily calculate the transition probability 
for a process of this t 3 rpe, if we assume that the kinetic energy of the ' 
ejected electron is very much greater than the ionization potential I 
and if we employ noi^-relativistio wave mechanics throughout. This 
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last assumption is equivalent to assuming that the kinetic energy of 
the electron in the continuous spectrum is small in comparison with 
the rest-energy me® of the electron. 

We shall carry out the calculation for an electron in a (l8) state and 
thus for the ii -electron of an atom. 

In the integral occurring in the expression (49) for the transition 
probability, Tg is thus the wave function 

Tq = (a = a^jZ) 

of the electron in the Z-shell, and, if the energy of the ejected electron 
is very much greater than the ionization potential, is the wave 
function corresponding to an electron in the continuous spectrum with 
momentum p, i.e. the plane wave 

If we write q = p—Tilhc, 

then q is the momentum transferred to the atom and we have 


where we have written a — helot. Using the result 


we then obtain 


J 


e-yr+«p<-)(iT = 





e-««)'FodT = 5>, 



J 87ra(fc)® 

{of+qY 


( 66 ) 


Substituting this value of the integral into the expression (49 a) for the 
transition probability and dividing by the intensity of the primary 
beam per cm.® (i.e. by c for a single quantum), we obtain the quantity 


32e® 2 


.da, 


(67) 


which has the dimensions of an area and is called the differential cross- 
section, or the effective area that has to be hit by a quantum to eject 
an electron in the solid angle dQ., 

To obtain the total cross-section for the ejection of photo-electrons 
in any direction we integrate (67) over all angles. Since the ^-shell 
contains two electrons we multiply by 2 to obtain the cross-section (f>^ 
for the photo-effect of the £-shell; the result of the integration isf 



t W. loo. oit., pp. 122-8. 
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Equation (48) then gives for the chance per unit time that a quantum 
is emitted into the solid angle dQ 

^^g^cos20i(Oir|w)12da (62) 


Multiplying this expression by hv and integrating over all angles we 
obtain an expression for the total intensity of the radiation emitted 
by the atom in unit time. The summation over the directions of 
polarization replaces oos®0 by sin*^, where 6 is the angle between the 
direction of propagation x and the position vector r of the electron; 
making this substitution in (62) and integrating over all possible values 
of the angles, we find that the total intensity radiated per unit time is 


647r*eV* 

3c» 


mr\n)\\ 


(63) 


This is the same formula as that given on p. 267. 

In the classical theory, if we take as a model of the light source a 
single electron bound elastically to a massive charge and moving with 
simple harmonic motion of frequency v in a straight line with amplitude 
r, we find that the energy emitted per unit time is 




3277^6 V -2 
3c« 


(64) 


We see therefore that the radiation is the same as that which would be 
given out by a classical dipole of amplitude r given by 

f» = 21(01r|n)|*. 


48.2. The photodectric effect 

We saw in the last section that when light is emitted by an atom 
its frequency v is given by the equation 

v = (ir^~w;)/A, (66) 

where are two energy levels of the atom. In a similar way it 

can be shown that light of frequency v can be a^bsorbed by an atom 
only if V satisfies a relation of the type (66). Another process is possible, 
however, for light of such a frequency that hv exceeds the ionization 
potential, J. It may then happen that the electron absorbs a quantum 
of radiatiqn of sufSicient energy to cause it to leave the atom with 
kinetic energy Av— /. We can easily calculate the transition probability 
for a process of this t 3 rpe, if we assume that the kinetic energy of the ' 
ejected electron is very much greater than the ionization potential I 
and if we employ noi^-relativistio wave mechanics throughout. This 
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The solution of equation (61) can readily be obtained if the solutions 
and the eigenvalues of the Sohrodinger equation (39) are known. 
If we write ^ ^ ^ ^ 

and substitute this wave function in equation (61), we obtain the 
equation ^ = 0. 

Multiplying both sides of this equation by the conjugate wave function 
and integrating over the space of all the coordinates—those describ- 
ing the radiation field as well as those defining the position of the 
electron — ^we obtain the system of differential equations 

fi„(0 = 12 (64) 

It follows from equation (63) that the quantity \bj,(t)\^ is the probability 
of finding the system in a state of energy at time t. 

Denoting by (n, 0) the state in which no radiation is present and the 
electron is in state and by (0, 1^) the state in which the electron is 
the ground state and a quantum hv^ is the only radiation present, 
equations (64) reduce to 


i „,«(0 = |2 ' 

8 

^ (6. 0)6„ 


( 66 ) 


Initially, at time t = 0, 

KfiW = 1 , hi.(0) = 0 . ( 66 ) 

If we now make the substitution 


= (67 a) 

in equations (66), we obtain immediately by a simple integration 


n 27 rt(vQ— Vg)— y 


(676) 


As t -> 00 the exponential factor decreases rapidly to zero so that 

i (O.I.IS1KO) 

A 2wi(vo— »,)— y* 

The probability that, after a time f, long in comparison with the life- 
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time l/2y of the excited state of the atom, a quantum of radiation h, 
has been emitted is therefore 

Multiplying by hv^ and the density function p,, integrating over all 
directions of propagation and summing over both directions of polariza- 
tion we obtain for the intensity distribution 

Now the total probability of a spontaneous transition per unit time is 
' = f J/v(».0|Sil0.1/d£l, 
lo 


W : 


SO that 


I(v). 


w 


27 t 47r2(v-Vo)2+y2 


(68) 

(69) 


where we have written Iq for hvQ. 

We have yet to determine the constant y. Substituting from equa- 
tion (67 a) into the first equation of the set (66), we obtain the relation 

1 ^ (to,0|6i|0,1,)» ^ 

ft® ^ 2 jr»(vo— V,)— 

\ 

We now replace the summation with respect to s by an integral over 
V,, first multiplying the integrand by py^dwd^l. We then have to 
evaluate an integral of the type 

, 1 - 


j 


^ .-exp{^(v,-v,)-^y}l 

27n(vo— V,)— y 


' which reduces to if(vo). It then follows immediately that 

y = Jw, 

where w is given by equation (68). The formula (69) then reduces to 


n \ ^ 


(70) 


The form of the distribution function I{v) is shown in Fig. 63. I(v) has 
its maximum at v — when its value is 

J(Vo) = 2IJ7TW. 

It should also be observed that 






The fbnBula (70) is idmrtioal with that deduced in the olageioal tbecoy 
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for exponentially damped harmonic oscillators; in the classical result 
the formula (68) for w is replaced by 

w = STT^e^/Smc®. 

Although the intensity of the line is measured by the total area under 
the curve, a reasonably accurate estimate of the intensity is given by 




the height of the peak. In actual practice it usually happens that there 
are several components of different intensities very close to each other. 
If the half-widths of the lines are comparable with the separation we 
get a distorted curve, in which it is difficult to distinguish the individual 
components; for example. Fig. 64 shows the intensity curve of the 
•£r^-Une M measured by Hansen. f In this instance the experimental 
curve is composed of three component curves of which the central is 
almost completely overlapped by the other two. 

We have only considered the broadening of spectral lines caused by 

t G. Hansen, Ann. der Phy§ik, 78, 668 (1926). 
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the damping due to the emission of the radiation itself. In addition 
to this there are several other causes contributing to the broadening 
of a line: 

(a) The Doppler effect due to the fact that the radiating atoms are 
not at rest.f 

(b) The effect of collisions of excited atoms with neighbouring 
atoms.l 

(c) Interactions of excited atoms with neighbouring atoms leading 
to a shift or splitting of the excited states.§ 

We shall not consider these effects here; for accounts of them the 
reader is referred to the papers cited. 

49. Radiation processes of higher order 

In the radiation processes considered above it has been possible to 
obtain the transition probabilities by calculating the matrix element 
(n\!^i\m) for the transition from an initial state m to a final state n, 
Si denotes the interaction between the electron and the radiation field, 
and TJrt are eigenfunctions of the wave equation (39). In these 
problems it so happened that these matrix elements did not all vanish. 
In many problems the matrix elements vanish for direct transitions 
from the initial to the final state. When this is the case it is necessary 
to improve the method of approximation. As in the case of second- 
order perturbation theory (Chap. Ill above), we have to repeat the 
calculations, retaining terms of a higher order. The inclusion of these 
higher order tenns is equivalent to assuming the existence of ‘inter- 
mediate’ states n' such that (n\^i\n') and (w'|Si|m) do not vanish 
although (nlSilm) does. 

Thus in the case of scattering by an atom when a quantum ko is 
absorbed and a secondary quantum k is emitted we have two such 
intermediate states. In the first ko has been absorbed and k is not yet 
emitted; in the second k has been emitted and ko is not yet absorbed. 

In the general case, let us denote the initial state by no, the final 
state by n, and the intermediate states by n'. Then assuming a solution 
of the,wave equation (61), in a series of the form (63), we have initially 

KW = 1. 

and all the other 6’s zero. 

t M. N. Saha and B. N. Srivastava, A Treatise on Heat, 2ad ed., Appendix 4 
(AUahabad, 1036). 

X J. H. Van Vleok and V. F, Weisskopf, Bev, Mod, Phys, 17, 227 (1946) ; H. FrChlich, 
Nature <1946). 

I y. Weiadcopf, Phys, Zeits. 34, 1 (1933); Obwrvatory, No. 713 (1933). 
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Thus, as a first approximation, we may take 6^, to be unity on the 
right-hand side of equation (64), and all the b^(t) zero except the 6^*. 
In this way we obtain the system of differential equations: 

= (71) 


n' 


(72) 


Integrating equation (71) we obtain 

M«) = (w1$iK) - ^-2# > (73) 

^no "n' 

the condition b^>(0) = 0 being satisfied. Substituting from (73) into 
(72) we derive the differential equation 


for the determination of b^. Integrating this equation we obtain 


which may be written in the form 


Kit) = 


w —W 



(«l6iK)(»'l6iNo) 

iw.-w,m^-w„) 




with 


Z: _ V («ISlK)(»'l§ll«c) 

~ Z w-w, ’ 


(74) 


If we form |6„(0 1* and neglect terms containing W^—W^' sinoe they play 
no role in this case, we obtain 


1M<)1* = 


2||ii; 


iWn,-W„)^ 


1— COS^-^ 




h 


(76) 


If, as before, we assume that in the neighbourhood of the final state 
there is a large number of states with energy between W and W+dW^ 
then by means of equation (76) we obtain for the transition probability 
per unit time 9 ^ 

«' = ?Ppr.l5il*. (76) 


For radiation processes for which both the matrix element (»oISil») 

and the sum T in\Sfi\n')(n'\t>iK)l{W„-W„.) yanish, it is necessary to 

w 

assume that two successive intermediate states n' and n'' exist for 
wn T 
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wluch and {n''\^^\n) are different frotn zero. The 

transition probabihty is again given by equation (76) except that now 
5i is defined by the equation 

The transition probability per unit time for the simple types of radia- 
tion problem considered in § 48 is, of course, also given by equation (76), 

but now 2 - / I/- , V 

Di = KISil^)- (78) 


Thus we may classify radiation processes according to the type of 
formula used for the calculation of the quantity Hi occurring in the 
formula (76). The radiation process is said to be of the first order if Hi 
is given by equation (78), of the second order if it is given by equation 
(74), of the third order if it is given by (77), and so on for higher orders. 
Furthermore, it is evident from equation (78) that and consequently 
the matrix element (^olSil^)? is of the first order in e, so that for a 
radiation process of the first order, the transition probability per unit 
time, w, is of order e^. Similarly for radiation processes of the second 
order w is of order e^, and for those of the third order w; is of order e®. 

Examples of radiation processes of the first order have already been 
considered above (§48). As an example of a radiation process of the 
second order we may cite the Compton effect. In this case the initial 
state consists of a quantum with momentum kg (Ikpl = hvQjc) and a free 
electron at rest with energy Wq = in the final state we have a 
quantum of momentum k and an electron with momentum k^—k and 
energy Wo+A(vo—v). The only possible intermediate states are: 

(a) One in which the electron has momentum ko and there is no 
light quantum present. The quantum of light has been absorbed 
by the electron; in the transition to the final state the electron 
emits a quantum of momentum k. 

(b) Another in which the electron has momentum — k and there are 
two light quanta with momenta ko and k present. The electron 
has emitted a quantum of momentum k; in the transition to the 
final state it absorbs the quantum with momentum ko. 

It is ij^ssible to calculate the matrix elements corresponding to each 
of the transitions and from them to calculate the differential cross- 
section for the scattering process.f Other examples of radiation pro- 
cesses of the second order are: 

(i) The coherent scattering of light. 


t W. Heitler, The Quantum Theory of Badiationt 2nd od. § 16. Cf. Chap. XI. 
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(ii) The Hainan effect. 

(iii) Resonance fluorescence. 

(iv) ‘Two qfianta’ annihilation of positrons. 

Among processes of the third order the most important is ‘Bremsstrah- 
lung’ — ^the emission of radiation by an electron passing through the 
field of a nucleus; others include double scattering and the creation of 
positron»electron pairs in the field of a nucleus. 

It can be shown that the application of the quantum theory of 
radiation to aU of these problemsf leads to reasonable results if the 
interaction, between the radiation field and an electron is con- 
sidered as small and the solution only worked out to the first non- 
vanishing approximation in If, however, the higher approximations 
in §1 are considered the result always diverges, so that the higher 
approximations have no physical meaning; thus it is found that the 
second approximation of the interaction energy between an electron 
and the radiation field diverges even if there are no light quanta present 
in the field. This energy is known as the transverse self-energy of the 
electron. The theory we have developed cannot therefore be regarded 
as exact. Recently, however, Heitler and Pengt have shown how the 
theory may be formulated to eliminate divergencies of this type. 

50. Radiation damping 

Introduction, It is well known that in considering the motion of an 
electron in classical electrodynamics it is necessary to consider the 
reaction of its field on its motion. The case of electromagnetic radiation 
in interaction with an electron is of interest in many applications, but 
the discovery of the meson in cosmic-ray experiments and the formula- 
tion of meson theories of nuclear forces has emphasized anew the im- 
portance of considering ‘damping' effects in radiation processes. 

50.1. Classical theory of radiation damjping 

The effect of radiation damping in the classical theory of radiating 
electrons was first considered by Lorentz, who used as a model for the 
dectron a small sphere charged with electricity and hence possessing 
mass on account of the energy of the electric field surrounding it. 
Lorentz showed that the retardation of the electromagnetic field inside 
an electron produces a force which is equal to 

Je^v/c*. (79) 

This expression is only a first approximation to the correct result for 
t See p. 287 below. t ^ P* ^^1 below. 
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f ’’ A, 

an electron of finite radius, being the first term of an expansion in 
ascending powers of rQvjCy where v is the frequency and Tq the radius 
of the classical electron. The form of the higher terms of the series 
depends upon the nature of the structure of the electron — ^i.e. on the 
nature of the forces that hold the charge on the electron — and there- 
fore cannot be treated in the classical theory. The main reason for 
retaining the Lorentz model of the electron was that it gave rise to the 
concept that all mass is of electromagnetic origin. However, the 
discovery of the neutron has proved that an elementary particle can 
have mass which is not of electromagnetic origin and thus removed 
the main reason for retaining the finite radius of the electron in the 
Lorentz model. 

The main difficulty in returning to a model of the electron as a point 
charge is that, if we apply Maxwell’s theory, the field in the immediate 
neighbourhood of the point electron has an infinite energy. This diffi- 
culty may be resolved in either of two ways. It might be possible to 
devise a scheme of electrodynamics in which the field of a point charge 
is modified for distances smaller than a certain critical radius in such 
a way that the energy of the field becomes finite ; the field strength 
calculated from the modified theory would then be less than the classical 
value for distances smaller than the critical radius. A modification of 
the classical Maxwell equations of this type is the non-linear field theory 
of Bom and Infeldjf because of the non-linearity the equations of this 
theory are not easily handled and can be carried over into quantum 
mechanics only with great difficulty. 

Another way out of the difficulty is to consider that the classical 
value of the field strength as measured by the force on other particles 
is valid for all distances, but the particle’s own field does not contribute 
to its rest mass. This is the basis of Dirac’s classical ^theory of radiating 
electrons. J Dirac retains Maxwell’s theory to describe the field right 
up to the point-singularity which represents the. electron, and by sub- 
tracting the infinite self-energy of the point-electron from the total 
energy of the system in an invariant way obtaii^ a relativistic expres- 
sion for the force of retardation of the electromagnetic field inside an 
electron" which reduces to the Lorentz expression (79) in the non- 
relativistic approximation. 

Furthermore, Dirac brings forward evidence for believing that this 

t M. Bom, Proc. Roy. 8oc. A, 143 , 410 (1934); M. Bom and L. Infeld, ibid. 144 , 425 
( 1934) ; 147 , 522 (1934) ; 150 , 141 (1935). 

t P. A. M. Dirao, ibid. 167, 148 (1938). 
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expression is exact within the limits of the classical theory. The self- 
consistent scheme of equations set up by Dirac for the calculation of 
effects due to the interaction of electrons with radiation is identical 
with the set due to Lorentz. The two theories differ, however, in their 
interpretation of the finite size of the electron; in Dirac’s theory the 
finite size reappears in a new sense, the interior of the electron being 
a region of failure, not of the equations of the electromagnetic field, but 
of some of the elementary properties of space-time. 

For instance, let us consider the case of an electron initially at rest 
being set in motion by the passage over it of a pulse of radiation of 
infinitely short duration. If the pulse is moving along the 2 ^-axis but 
is polarized in the a;-direction the electric force may be represented by 

E==(kS(t-y)A0), 

where ik is a constant and 8 denotes the Dirac delta function. f If i; is 
so small that the electron acquires a velocity small in comparison with 
that of light we may use the non-relativistic equation of motion 

mv-^^V = F, (80) 

u C" 

which reduces in this case to 


xlr—x = Kh(t), (81) 

where r = 2e^l3mc^, k — Zc^kj2e^. 

The solution of equation (81) is readily found to be 
^ _ I (t < 0) 

\ kjm {t > 0). 

Thus for positive values of t the electron is moving with constant 
velocity ifc/m, as we would expect. There is, however, a departure from 
the ordinary ideas of causality; although the pulse does not act on the 
electron until the time ^ = 0 the electron acquires an acceleration hefort 
that time. It would appear that the electron knew in advance that 
the pulse was coming and acquired an acceleration just sufficient to 
balance the effect of the pulse when it does act. We can surmount this 
difficulty by assuming that the electron ha^ a finite radius of order 

CT = 2e*/3mc® 


t The Dirao delta function is defined to be zero when the argument is not zero and to be 
infinite when the argument is zero in such a way that the function and the axis enclose 


unit area, i.e. 


a(aj)«:0 8(a;) = oo (« « 0) 


CO 

J *{*)(&>- 1. 

— 00 
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inoe tho electron begins to acquire an acceleration when the pulse is 
listant from the centre of the electron by this amount. 

Suppose now that electromagnetic waves are being radiated from a 
)oint A and detected at a point B a distance x+y away; and that 
here is an electron at a point E on the line AB distant x from A, 
Chen if a pulse is emitted by ^ at ^ = 0, the electron will start radiating 
.t a time 

nd a signal will be detected at B at time 


(x+y)lc-T. 

f we denote by c' the equivalent velocity of light in the interior of 
he electron and by 6 the radius of the electron, we then have 

x-b , y-b ,26 x-\-y 

_ "T - 1 

c c c c 


irhence we obtain c' = c(l— ct/26)-i. 

Tow b is of order cr so that c' is of order 2c; that is, it is possible for a 
ignal to be transmitted through the interior of an electron with a 
’’elocity exceeding that of light — ^in contradiction to our ideas of the 
lature of space and time. 

The theory proposed by Dirac thus violates the postulates of the 
heory of relativity; for all that, the theory is constructed in such a 
ray as to be invariant under Lorentz transformations. 

The difficulty that the electromagnetic energy becomes in&aite in the 
leighbourhood of a poiht charge has been removed by Prycef by the 
ntroduction into the energy integral of an additive vector field, which 
lepends on the variables of the charge alone, so as to make the integral 
inite at the point charge. This additional field may be regarded as 
; negatively infinite energy having its origin in the charge itself. The 
procedures adopted by Dirac and Pryce are different, but both consist 
a adding terms to the usual expression for the field energy which make 
he total field energy of a point charge finite. ThaC this procedure is 
physically sensible and not merely a mathematical device was suggested 
py Bhabha and Corben,J who extended the method to give the com- 
plete classical theory of a spinning particle moving in aneleotromagnetic 
ield. 

t M. H. L. Pryce, Proe, Boy. 800 , A, 168, 389 (1938); see ako G. Wenteel, ZeUa.t 
*hy9ik, 86, 479, 635 (1933) ; 87, 726 (1934). 

t H. J, Bhabha and H. 0. Corben, Proo, Boy, Soo, A, 178, 278 (1941). 
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Dirac’s theory leads to the non-relativistio equations of motion 

»»t-gV:=F, (82) 

where F denotes the external force and dots denote differentiation with 
respect to L These equations differ from the usual Newtonian equations 
of motion mv = F 

in that they involve differential coefficients of the third order. To solve 
the system of equations completely for the space coordinates (x,y,z), 
we need to know their initial values and those of v and v, thus intro- 
ducing more arbitrary constants into the solution than are necessary 
to fix the actual motion. The solutions of the equations of motion fall 
into two types — ^physical and non-physical solutions — ^which must be 
distinguished. In solving particular problems the procedure is to write 
down all the solutions of the equations (82) and then determine which 
of them are physically possible. For example, in the case of an electron 
moving in the absence of an external field, there are two solutionsf of 
the relativistic equations, one in which the velocity of the electron 
tends to that of light and a second in which the electron moves with 
uniform velocity; the former solution is non-physical, the latter physical. 
The solutions corresponding to the motion of the electron of the 
hydrogen atom have been discussed by Eliezer,J who shows that there 
is no solution in which the electron falls into the nucleus. Eliezer has 
also considered the effect of radiation damping on the motion in a plane 
of an electron in a uniform magnetic field§; the physical motion is one 
in which the electron spirals round inwards until it comes to rest, but 
there is also a non-physical solution in which the electron spirals out- 
wards with increasing velocity. 

As an example of the use of equations (82) we shall consider the effect 
of radiation damping on the scattering of light by a free electron, 
A light wave with the frequency v and the field strength 

E = 

may fall upon a fi^ electron which is initially at rest. The equation 
of motion (82) then becomes 

= eEoe**^*^, (83) 

where, for convenience, we have written 

y = 2c*/3c». (84) 

' t loo. oit., p. 166. 

t C. J. Elieaer, Proc. Oatuft. PhU. Soc. 39. 178 (1943). 


§ Ibid. 42 . 40 (1946). 
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Equation (83) has the periodic solution 


X = 


where 
so that 


-m(277v)*Xo+t(27rv)*yXo = eEg, 
Xa = 


477V(wi — 2i77vy) 
The total energy radiated per second is now 


5 — ? = — IE 

3 c8 3c3' m^(l+K^y 


where 


27ryv __ 47re^ 


m 3mc® 

and the total cross-sectionf for scattering is thus 

1 




3 [mcy l+zc® 
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50.2. Qmntum theory of radiation damping 

The incorporation into the quantum theory of the classical methods 
of Lorentz and Dirac has recently been discussed by several authors. 
The first quantum treatment of the effect of radiation damping on the 
scattering of light by free electrons was given by Waller, { but it was 
only by ignoring the initial conditions which the solutions of the per- 
turbation equations must satisfy that he was able to derive a solution 
of the equations. Waller’s theory was developed further by Gora,§ 
who overcame the difficulty arising from the initial conditions by 
assuming that the interaction, between an electron and the radiation 
field, is switched on adiabatically. Thus we assume that JTi == 0 initially 
and is switched on over a time period large compared with the period 
of the light wave. Gora’s results are also incorrect because of the 
omission of some of the terms in Waller’s equations. The correct 
equations to include the effect of radiation damping for the special 
case of scattering were derived simultaneously by Heitler|| and Wilsonft 
by entirely independent methods; Heitler adopted Gora’s assumption 
of adiabatic switching on of the perturbation ener^, and Wilson em- 
ployed a limiting process to eliminate the terms which make the 
solution of the problem difficult. A divergence-free quantum theory 
of rai^tion including the effects of radiation damping was formulated 

t Of. H. Casimir, Zetto./. Phyaih 81, 496 (1038). 

t I. Waller, ibid. 88, 436 (1934). 

§ E. Gora, Acta Phys. Pohnicoy 7, 169, 374 (1938-9). 

H W. Heitler, Proc. Comb. Phil. 8oe, 37, 291 ()941). 
ft A. H. Wilson, ibid. 37, 801 ^1941). 
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subsequently by Heitler and Peng.f The theory prbposed by these 
authors is frankly heuristic; they arrive at a divergence-free formation 
by simply omitting the divergent integrals from the original theory. 
This formalism is then shown to be Lorentz invariant. A general pro- 
cedure for this omission has so far only been formulated for one class 
of problems, the calculation of transition probabilities. Despite the 
heuristic basis of the Heitler-Peng theory its authors claim that, in view 
of its close agreement with experimental results, it will form a correct 
part of future quantum electrodynamics or a good approximation, 
having perhaps a similar relation to future quantum theory as the 
Bohr theory had to wave mechanics. 

Independently of these investigations DiracJ made an attempt to 
produce a new method of field quantization which would remove the 
divergence difficulties from quantum electrodynamics. By introducing 
the idea of quanta of negative energy and making use of a certain 
limiting process, Dirac showed that in his theory the integrals corre- 
sponding to the divergent integrals of the previous theories are con- 
vergent. Dirac’s theory has certain features in common with the 
Heitler-Peng theory; m the limit of quanta of low energy both theories 
reduce to the classical theory of radiating electrons due to Dirac.§ The 
relation between the two theories has been discussed by Gk)rmley and 
Heitler,|| who find that, contrary to expectation, the Heitler-Peng 
theory is not a consequence of Dirac’s quantum electrod3mamics. It is 
found that for processes involving the scattering of high-energy quanta 
the two theories give rise to very difiPerent results, and that, in certain 
cases, Dirac’s theory has no solution at all whereas the Heitler-Peng 
theory has. This last difficulty arises from the quanta of negative 
energy, and it would seem that this part, at least, of Dirac’s theory 
has to be abandoned. 

No completely satisfactory basis for the subtraction process employed 
by Heitler and Peng has yet been found, but numerous applications 
have led to results in agreement with experiment. As this theory is 
discussed at some length in the last chapter of Heitler ’s bookff we shall, 
in the following sections, mainly follow the method of treating radiation 
damping developed by Wilson. The connexion of this method with the 

t W. Heitler and H. W. Peng, ibid. 38, 296 (1942). 

P. A. M. Dirac, Proc, Roy, Soc, A, 180, 1 (1942) ; Comm, Dublin Inat. Adv, StudieSt A, 
No. 1 (1943). Of. W. Pauli, Rev, Mod, Phye, 15, 176 (1948). 

§ Of. last section. 

li P. G. Gormley ai^d W. Heitler, Proc. Roy,Ifi8h Acad, 50, A, 29 (1944). 
ft W. Heitler, TM Quoftfum Theory of RadMH¥mf 2nd ed. {Oxford, 1942). 



§50 


282 INTEBACTION Of RAlJl^aN WITB MATTER 

physical concept of 'damping* is not immediately obvious; it would 
seem that Wilson’s method consists merely of a more accurate solution 
of the wave equation for a Hamiltonian involving the electron and the 
radiation field. The relation of this process to radiation damping is 
purely formal — ^the mathematical analysis cotresponding closely to the 
subtraction method of Dirac for the classical theory. 


50.3. Solution of the radiation eqmtiona for direct transitions 
We first of all consider the case in which the transition between the 
initial and final states takes place directly, and not through inter- 
mediate states. Let n^ denote the initial state of the system, and n 
one of the possible final states of the system in which a quantum of 
radiation of any frequency, polarization, and direction of propagation 
has been emitted, such that the energy of the complete system is 
approximately conserved. As in §48.3 we have to solve the wave 
equation 

of the coupled system where is the sum of the Hamiltonians of the 
pure radiation field and of an electron moving in a field free from 
radiation and is the energy of interaction between the electron and 
the radiation field. We assume again a solution of the form (63) where 
the Wjc are the eigenfunctions and eigenvalues of the Schrodinger 
equation So'** = 

It follows immediately from equation (64) that the perturbation equa- 
tions for the probability amplitudes hjf) are 

= ( 86 ) 

n 


( 87 ^ 

The initial conditions satisfied by the probability amplitudes are 

M0) = 0. (88) 

If we write the solutions of equations (86) and (87) in the form 


e-y*, 6„(«) = 


W^-W,+i}iv ’ 


we see that the initial conditions (88) are automatically satisfied. The 
function is a function of the variables defining the final state; it is 
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assumed to be a slowly varying function of which is neither infinite 
nor zero when If we substitute for in equation (87) 

we obtain immediately 

««(<) = 

As before we replace the summation by an inte^al over Denoting 
by the number of states with energy between Wf^ and 

Wn+dWn for which the quantum of radiation is scattered into the solid 
angle the sum is then replaced by 




f 


w 

•'na 




'm 

Substituting for the sum in (89) and for 6„(i) by its value (88 a) we 
obtain finally 

Un = («ISil«o)+i’^[ / • (90) 

Equation (90) is an integral equation for the determination of the 
function I7„. 

We have next to determine the parameter y. Substituting from 
equation (88 a) into equation (86) we obtain 

Replacing the sum by an integral as before we have 

y = lj {%\^\n)UnPWn^n- (91) 

As an example of the solution of equation (90) we consider the^ 
scattering of light by free electrons. If we denote by Uq the initial state 
of the whole system (i.e. electron+ quantum) and its final state by n, 
then the matrix element (no|§il^) is given by equation (49a). Substi- 
tuting the Schrodinger wave functions for a free electron into (49 a) 


we obtain 


(»ol$iW = ^(eoe), 

mv 


where, as before, Bq and e denote unit vectors in the initial and final 
directions of the polarization respectively. As a trial solution we put 

= P(eoe), 
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then 


where 


U 


mv 


\-iK 
e\ J(eoe')(e'e)(la>' 


2mc* 

Now it is readily shown that 


(eoC) 


/ 


Stt , 


(eoe')(e'e)rfw' = y (eoe), 


so that K = Integrating over all directions of the emitted 

quantum and summing over the two polarizations we obtain the 

for the total cross-section of scattering, in agreement with the classical 
equation (85). 


50 . 4 . Tranaitions through intermediate states 

If in considering the scattering of light by free electrons we use 
Dirac’s relativistic equation for the motion of an electron, we find that 
the transition between the initial and final states cannot take place 
directly but must take place through intermediate states. We shall 
now derive the integral equations corresponding to equation (90) for 
transitions of this type. As before, let us denote the initial and final 
states by and n\ and let us denote an intermediate state attainable 
from the ground state of the atom by the absorption of a light quantum 
with a given frequency, polarization, and direction by m. The per- 
turbation equations then take the form 

= 1 {no\i>i\m)bJt)em,-w^n^ (92) 

m 

-mjt) = 2 (m|Si|n)6„(«)e«^«™. (93) 

= ” ( 94 ) 

m 

In these equations and are approximately equal, but may be 
very different from the energy of the intermediate state; the inter- 
mediate states are such that momentum is approximately conserved. 
The probability amplitudes \{t) again satisfy the initial condi- 
tions (88) and in addition 6„(0) = 0. Substituting from equations (88 a), 
the solution of (93) satisfying this initial oondition found to be 

= j(»»l&K)+»w[ J 

( 96 ) 
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Substituting &om (88 a) and (96) in (94) we obtain 


= 2 (»ISil»»)((»»l$il«o)+»’r[ f (m\^i\n')U^.pwid<oJ \ X 


e-y<+Wn«-W/«— 

Wn-W^+ihy 


(96) 


Let us now consider in particular the scattering of light by a free 
electron. If a free electron absorbs a quantum the resulting inter- 
mediate state mo is sharply defined by the law of conservation of 
momentum and the sum with respect to m reduces to a single term — 
that with m replaced by mo- The resulting equation has no solution 
since there is no term on the left which we can equate to the term with 
factor on the right. Instead of considering the electron to 

be free we consider it to be bound, the binding energy of the electron 
being small in comparison with the energy of the quantum of radia- 
tion. There is then a continuum of intermediate states and it is possible 
to make the corresponding terms involving on the right 

of equation (96) give a zero contribution to the final result. To 
demonstrate this we have merely to observe that the matrix elements 
(no|S|m) are not delta functions — although is— so that the 

summation with respect to m gives zero for sufficiently large values of f , 
since is a rapidly varying quantity and the denominator 

does not vanish at any point where the matrix elements are not zero. 
In this way we can equate the coefficients of and then 

pass to the limit of a free electron by replacing the sum by a single 
term. In this way we obtain the equation 


/ la I u i<x u \ (»l$lK)f 

„ n Si|TOo)(molSi|Wo) , y 




(97) 


for the determination of the unknown function U^, 

This procedure gives us the formulae for the scattering of radiation 
by a free electron initially at rest; by applying a Lorentz transformation 
we obtain the results for the case in which the electron is in motion 
originally. The method outlined above seems at first sight rather 
art^cial, but, as remarked by Wilson, it is in fact closely analogous 
to the procedure by which the scattering of an electron by a Coulomb 
field is derived from Bom’s scattering formula by calculating the scat- 
tering produced by a screened nucleus and then proceeding to the limit 


ofzerosoreenmg. 
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To determine the parameter y of equation (97) we substitute from 
equations (88 a) and (96) into equation (92) to obtain 

Eliminating the terms involving the factor and proceeding 

to the limit of zero binding we obtain 

% r r 1 

Wn,-W^+Hiy W„-W^,+iliy 

If there is more than one intermediate state in the case of zero 
binding the analysis proceeds along similar lines. 

The y*s in the denominators of equation (97) can usually be neglected 


and we obtain an 

integral equation of the type 

= U^^+iTT J K(n,n')U^,p^^,d(o^., 

(98) 

where 

1-70 _ V («iSil»»)(»»ISil»o) 

w —w 

(99) 

and 


(100) 


It is possible to investigate the influence of radiation damping on 
scattering processes by a more conventional method, namely, the 
stationary method of the perturbation theory. Ma and Hsiieh have 
shownf that this method gives rise to the integral equations derived 
by Heitler and Wilson for two-stage transitions when the effect of 
radiation damping is included in the analysis. 

50.6. Solutions of the integral equations 

Because of the complicated nature of the kernel of the integral 
equation it seems hopeless to attempt to derive a general solution. 
Some progress has, however, been made with the solution of the integral 
equations for certain special problems. The influence of radiation 
damping oix the formula for the scattering of radiation by &ee electrons 
has been investigated by Wilson,} using Dirac’s relativistic wave 
equation for the electron. Wilson considers the scattering in the frame 
of reference in which the total linear momentum of the system is zero; 

t S. T. Ma and 0. F. HsUeh, Proc, 0<mb, PhU, Soc, 40, 168 (1944). See also W. PaiiU, 
Meaon Theory of Nuclear ForeeSf chapter iv (New York, 1946). 

} A. H. Wilson, loo. oit., p. 807. 
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the kernel does not then depend too strongly on the angle of scattering 
and we can replace it by an average value. Thus equation (98) is 
replaced by the equation 

where the bar denotes a mean value of the nature of a root mean 
square. Hence 






l+ 16 ,r*|C?«|VV. 

With this approximation Wilson obtains the general formula 


_ d<^o 


( 101 ) 


for the differential cross-section, in terms of the differential cross- 
section for zero-damping, d<^o‘> initial momenta of the light quantum 
and the electron are p and — p. This formula is not exact; the second 
term in the denominator should be multiplied by a numerical factor 
of order unity to improve the approximate method of averaging. 

By means of this formula Wilson shows that the damping is given 
correctly by the classical theory for low frequencies; however, at high 
frequencies (^v > 137mc^) he finds that, contrary to classical expecta- 
tion, the influence of damping on the cross-section is always negligible.f 
Since all applications of radiation theory to high-energy problems of 
the type arising in the theory of cosmic rays depend on the smallness 
of the damping effect, this result is of considerable importance. Because 
of this, and the fact that Wilson’s procedure of averaging under the 
integral sign may not lead to correct results, the problem has been 
re-examined recently by Power; J expanding the solution of the integral 
equation in ascending powers of c®, Power shows that the extra term, 
due to damping, in the Klein-Nishina formula is small in comparison 
with the main term, thus confirming Wilson’s result. 

Approximate solutions of equations (90) and (97) have also been 
discussed by Hsueh and Ma§, by means of a variational method. 
Separating the summation over the directions of polarization from the 
other integration they write equation (90) in the form 

J ^ 


t The fonnula (Ellein-Nishina) for the cross-seotion without damping is disoussed 
on p. 316. 

t S. C. Power, Proc. Roy. Irish Acad. 50, A, 139 (1946). 
i 0. F. HsOeh and S. T. Ma, Phys. Bev. 67, 303 (1946). 



288 


INTERACTION OF EADIATION WITH MATTER 


§60 


and note that it is equivalent to the equation 

2 I f = 0, 

Pno P" •' ^ ^ 

where SU* denotes an arbitrary variation of the complex conjugate 
of Un* Substituting 

where a is a parameter independent of n and tIq and varying U*, we 
have 

f KISiWX 

Pn^Pn •' 

Solving this equation for a we obtain 

1 


1 * 2 

— %K 


where 


2 2 2 f f («olSil«)(»ISil»')(»'ISil«oW.piF.,<*“>«‘^»' 

Pm Pmf •> 




(noiSi|»){»|Si|noW,(Ja)„ 

D-- Pm J J 


Phq Pn 

The formula for the cross-section is now 




( 102 ) 


This formula is of the same form as Wilson^s result (101), but the 
damping factors in the two formulae are different. To test the validity 
of formula (102) it is of interest to use it to calculate the cross-section 
for the non-relativistio scattering of a quantum of radiation by an 
electron at rest and then to compare the result with the exact solution 
(86 a). Now from the equation 

(^ISiK) = ^(eoe) 


mv 


we have that 


V* (eoe)(ee')(e'eo)<?i>„(J«„- 

22 f (eoe)(ee„)da,„ 


and equation (102) then gives 

U - (^0^) 

^ mv 1— i#c 

in agreement with the exact result (85 a). 
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We have seen that the effects of radiation damping are unimportant 
for processes involving free electrons. On the other hand, it was found 
by Heitler and Wilson that damping effects are appreciable in all 
processes involving mesons. Since then there have been various investi- 
gations of the effect of damping in problems on the motion of mesons. 
Relativistic formulae for the scattering of mesons by nuclear particles 
under the influence of radiation damping have been derived by Ma and 
Hsueh.t The theory of radiation damping has also been applied to the 
creation of mesons by various processes. It was shown by Heitler and 
Pengt that in a collision between a fast nuclear particle and another 
nuclear particle at rest mesons are emitted with a very high probability; 
the production of mesons by light quanta has also been inve8tigated.§ 
These papers form the basis of a theory f, t which gives a satisfactory 
' account of all the chief cosmic-ray phenomena associated with mesons, 
including their creation, their diffusion through the atmosphere, meson 
showers, and the transformation of charged mesons into neutral mesons; 
it is assumed that the mesons are produced by a primary radiation of 
protons. The agreement with the experimental results is sufficiently 
close to make it seem probable that the theory of cosmic-ray mesons 
based on the quantum theory of radiation damping is fundamentally 
correct. 


t S. T. Ma, Proc. Camb. Phil. Soc. 39 , 168 (1943); S. T. Ma and C. F. Hsiieh, ibid. 

t W. Heito and H. W. Peng, Proc. Roy. Irifth Acad. 49 , A, 101 (1943) ; W. Heitler, 
ibid. 50 , A, 155 (1945) ; H. W. Peng, ibid. 49 , A, 245 (1944). 

§ J. Hamilton and H. W. Peng, ibid. 197 (1944); J. Hamilton, W. Heitler, and 
H. W, Peng, Phya. Rev. 64 , 78 (1943). 
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XI 

RELATIVISTIC QUANTUM MECHANICS 

51. Introduction 

The theory of the spinning electron developed by Pauli and Darwin 
(Chap. IV) oh the basis of the Schrodinger wave mechanics is incom- 
plete in two important respects. Though it provides a suitable mathe- 
matical scheme for the description of many of the properties of the 
electron’s spin deduced from spectroscopic observations, it gives no 
satisfactory account of its origin. Furthermore, no attempt was made 
to take account of the special theory of relativity, so that the theory 
cannot be applied to the discussion of problems involving electrons of 
high energy; and for many problems, such as the fine structure of the 
spectral lines of hydrogen, the corrections introduced by spin and by 
relativity are of the same order. Before the introduction of the concept 
of electron spin into wave mechanics various attemptsf were made to 
establish a wave equation of the Schrodinger type invariant under 
Lorentz transformations. Unlike the non-relativistic Schrodinger 
equation, these equations were of the second order in the time- 
differential operator djdt, Dirac solved both problems, that of con- 
structing a relativistically invariant wave equation and that of 
explaining the origin of the electron spin, by observing that if. the 
general interpretation of non-relativistic wave mechanics is to be pre- 
served, the relativistic wave equation must be linear in djdt. The 
beauty of Dirac’s method lies in the fact that'once a Lorentz-invariant 
wave equation, linear in the operator djdty has been constructed, the 
magnetic moment assumed in the model of the spinning electrons pro- 
posed by Uhlenbeok and Goudsmit appears as the result of a simple 
calculation without the addition of a new physical assumption. Dirac’s 
theory of the electron has met with conspicuous success in its applica- 
tion to the fine structure of atomic spectra, the scattering of fast 
electrona and of hard y-rays, and the formation df pairs of electrons 
and positrons. The agreement with the observed experimental results 
is so good that it appears that Dirac’s equation for a single electron is 
essentially correct. On the other hand, there is considerable doubt as 
to the validity of the extension of the theory to two electrons. 

Before considering in detail the possible forms of the relativistic 

t £. Sohrddmger, Ann, dtrPhyaik, 81, 109 (1926); 0. Klein, ZeUs.f. Phyaik, 37, 895 
(1926); V. Fook, ibid. 38, 242 (1926); ibid. 39, 226 (1926). 
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wave equation for an electron it will be of advantage to state the general 
conditions that we should expect such an equation to satisfy. The 
equation should satisfy the requirements of the special theory of rela- 
tivity, that is to say, that it should be invariant under the group of 
Lorentz transformations. There is another kind of invariance which 
the equation must possess; this is ‘gauge-invariance*, the importance 
of which was realized first by Hermann WeyLf In classical electro* 
dynamics the field strengths E and H are related to the scalar and 
vector potentials ^ and A by the formulae 

H = curl A, E = —grad — i 

c dt 


divA-J-i^ = 0. 
dt 

The transformation 


^ ^ec dt ' 


A = A' — grad A, 


where A is a solution of the wave equation 

1 ^ 
dt^ 


V2A == ^ — 


leaves the form of the above equations unaltered and is called a gauge 
transformation. 

We should also expect to be able to form a 4-vector representing 
current and charge density and to prove by means of the wave equa- 
tion that the current satisfies a continuity equation both in the presence 
and in the absence of an external electromagnetic field. (The analogous 
case in the non-relativistic theory is given on p. 269.) Finally, we 
should be able to predict the existence of the spin and of the magnetic 
moment of the electron from the wave equation. 

At first sight it vrould appear that it is necessary for the theory to give 
only positive values for the energy of the particle. We shall see that 
negative energy states of the electron arise in Dirac *s theory but that 
it is possible to make use of them to account for observed properties 
of positrons (positively charged particles with the electronic charge 
and mass). Though Dirac’s theory had this unexpected triumph of 
identifying the positron with a ‘hole* in the aggregate of negative 
energy states, such a hypothesis, strictly speaking, removes from the 
theory its original basis as the theory of a singk electron. Indeed, if 

t H. Weyl, Chruppentheorie und Quantenmechanik (Leipzig, 19^). 
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we wish to obtain valid relativistic interpretations we cannot even 
properly formulate the problem of a single particle; we must instead 
study the many-particle problem from the beginning. In the many- 
particle problem in which the particles are considered to be moving 
at very high speeds the charge density — eP need not be negative (in 
the algebraic sense) because of the possibility of the creation and 
annihilation of positrons. In other words, the probability function P 
need not always be positive in the many-electron problem. 

52. Scalar relativistic wave equations 
52.1. The Hamiltonian for the relativistic motion of an electron 
In this section a derivation will be given of the Hamiltonian of a 
charged particle moving in an electromagnetic field. Consider the 
motion of an electron of rest mass m and charge —e in an arbitrary 
electromagn^ic field whose electric and magnetic vectors are denoted 
by E and H respectively. When the velocity of the electron is v its 
momentum is given by 

TflW 

4y^)’ 

so that the equations of motion are 

Now if we denote the scalar and vector potentials of the field by ^ 
and A, then 

E=-grad^-~, H = curlA, 


and the vector equation of motion of the electron (1) reduces to 

The equations of motion (2) may be written in the Lagrangian form 
di 


d l^I\ ^ . 


(3) 


if the Lagrangian function L is taken to be ^ 

L== — »icy(l— v*/c*)+€^ — -(v.A). 

c 

This may easily be verified. The conjugate momenta, ai^ defined by 
the lelations 

. % 
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SO that for the dynamical system described by the Lagrangian function 
(3) th© momenta are where the vector p is given in terms 

of the vectors v and A by the equation 


The general expression 


P = 


wiv 


V(1-V7C2) C 

H = lqiPi-L 


?A. 


( 4 ) 


for the Hamiltonian of a system whose Lagrangian function is L, 
reduces for a single particle to the simple form 

H = {p.\)-L. (6) 


Substituting from equations (3) and (4) into equation (5) we obtain 






Solving equation (4) for v and substituting in this last equation we 
find that in terms of the momentum the Hamiltonian of a single particle 
moving at speeds comparable with that of light is 

H = c-^{m2c2+(p-f cA/c)2}— e^. (6) 

For a conservative system the Hamiltonian, H, is equal to the energy 
of the system, which is denoted in this book by W. Writing = If/c, 
we see that equation (6) may be written in the symmetrical form 




52.2. Approximate relativistic equations 

The difficulty in setting up the relativistic Schrodinger equation 
directly from the Hamiltonian (6) arises from the interpretation of the 
square root. Approximate equations can, however, be derived by 
expanding the Hamiltonian H defined by equation (6) above in terms 
of vjc. In this way we obtain as a first approximation (cf. Chap. I, § 8) 

In the second approximation we replace equation (8) by 

If in this equation we write H — W'+mc^ and replace (p+©A/c)*/8w®c* 
by its first approximation (W'+e^)®/2mc* given by equation (8), we 
find that to the order 
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The Sohrodinger equations corresponding to these Hamiltonians are 
obtained in the usual way by making the substitutions 



in equations (8) and (9). This procedure applied to (8) yields the 
equation 

^|grad+^) jJgrmi+^jY+(mc*-e^)Y+J ^ = 0, 

which may be reduced to the form 

= ^^+mc»'F. (11) 


In this equation it should be noted that div A does not vanish unless 
the scalar potential is constant with time, since (j) and A are related 
by the equation , ^ . 


divA+-7 = 0. 
dt 


The equation ( 1 1 ) is correct only to the order vjc, where v is the velocity 
of the electron. 

It will be noticed that equation (11) differs from the non-relativistic 
equation used in § 8 only in the small term in A*, which can be neglected 
in most applications, and in the term moW, which can be removed by 
transforming to the new wave-function 

Y' = QimcHjhY, 

. If we multiply equation (11) by Y*, and its complex conjugate by 
Y and subtract, we obtain an equation which may be written in the 
form of a continuity equation 

l+divj = 0, (12) 

if p and j aie taken to be defined by ^ 

p = -eW* 

J = „4^(Y*gTad Y-Y grad Y*) - — AYY* 

2%m tnc 

Similar results may be obtained from equation (8) giving a betto 
approximation. 
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52.3. The Kkin-Oordon Eqmticm 

It was suggested by Gordonf that the exact relativistic wave equa- 
tion should be established by making the substitutions (10), not in 
equation (6), but in the more symmetrical equation (7) and allowing the 
operator so formed to act on a wave-function T. In this way we obtain 
the relativistic wave equation 


This equation is known as the Klein-Gordon equation. 

The Klein-Gordon equation involves us in serious difficulties in 
interpretation which arise because it is not linear in the time-differential 
operator djdt. In non-relativistic quantum mechanics the Schrodinger 

e,-i« («+5|)'>' = 0 


is linear in djdt. It follows that if T* is known initially, the equation will 
determine it at all subsequent times. The results of any measurements 
on the position and momentum of a particle are used in forming the 
wave-function T; the correct initial condition is thus that T should 
be known, not Y and dWjdL With the Klein-Gordon equation, on the 
other hand, it would be necessary to know both T and dYjdt in order 
to determine T at all subsequent times. 

The second difficulty arises when we attempt to find an expression 
for the current and charge density. Multiplying equation (14) by T*, 
the complex conjugate equation by Y, and subtracting we find that 
the resulting equation may be written in the form (12), where now 


1 = -eP = : 


mc^ 


fh #>2 ' ' 

j = gradT*} - — ATT* 

2m^ ' ^ me ) 

These expressions for the charge density and the current density are 

due to Gordon and Klein. { The difficulty arises from the interpretation 

of the quantity P defined by the first equation of (16). In the non- 

relativistic Schrodinger theory P would be the probability function for 

the position of an electron. Such a probability function satisfies two 

general conditions: 


P>0. 


'.jpir.O. 


t W, Gordon, Zcita.f. Physik^ 40, 117 (1926). 

I W. Gordon, ibid. 40, 121 (1926); 0. Klein, ibid. 41, 414 (1927). 
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the integral being taken over all space. In a completely satisfactory 
relativistic theory we should expect P to satisfy both of these conditions 
together with the further condition that 

jpi, 

is invariant under Lorentz ^transformations. We have already shown 
that the condition (ii) is satisfied by showing that p satisfies a continuity 
equation of the type (12). The third condition is satisfied automatically 
if it can be shown that p may be expressed as the fourth component 
of a 4-vector. This is readily done by considering the transformation 
properties of the set of four quantities 

The condition (i), however, is not satisfied, since 




2mcH\ dt dtj'mc^ 


and this can be made negative, both T and dY/dt being arbitrary at 
the initial point in time. P is not therefore a satisfactory expression 
to replace the probability function TT* of the non-relativistic theory. 

Although the expression for P given by equation (15) seems at first 
sight to be totally different from the non-relativistic formula TT*, it 
can easily be seen that the formula derived from the Klein-Gordon 
equation reduces to that given by the Schrodinger equation when the 
kinetic energy of the particle is very much smaller than its rest energy. 
For if we substitute ^ 


in the first of equations (15), we obtain 

(17) 

^From the fact that W — W'+mc^ it follows immediately that 

P=YY*(l + ^^]. (18) 

\ mc^ I 

If now the kinetic energy is assumed to be small ki comparison with 
the rest energy me* we find that P is approximately equal to YY* as 
in the non-relativistic theory. 

53. The Dirac wave equation for an electron 
53.1. TJ^ Dirac equatim for an arbitrary field , 

The considerations of the last section show that the correct relativistic 
wave equation must be linear in the differential operator djdt] further- 



§ 63 THE DIRAC WAVE EQUATION FOR AN ELECTRON 297 

more, in conformity with the special theory of relativity, the equation 
must be invariant under a Loreutz transformation. In the theory of 
relativity there is complete symmetry between the space coordinates 
X , !/, z and t^ time coordinate kt, so that if our equation is linear in 
djdt it must also be linear in didx, djdy, and djdz. For this reason 
Diracf took the relativistic wave equation to be of the form 

(19) 

where, as in the last section, ^ denotes the scalar potential and 
{Aj,y AyyAg) the vector potential; the operators Pa.,Py, Ps are defined by 
equation ( 10 ) and p^ denotes the operator (ihlc)(dldt). In equation (19) 
the quantities oL^y oty, a^, jS are not ordinary numbers; their nature will be 
defined below. To determine the laws governing these quantities Dirac 
made the additional assumption that for an electron in the absence of 
a field the wave equation (19) would reduce to the Klein-Gordon equa- 
tion (14). In the absence of an electromagnetic field both and A 
vanish and equation (19) reduces to 

Multiplying this equation by the operator 

Po~\^^xPx~^^yPy~^'^zPz'^^> 
we obtain the second-order wave equation 

{-^0+ 2 '4pI+ 2 2 (“x^+^“x))^ = 0- 

' xyz xyz xyz 

This reduces, to the Klein-Gordon equation for a free particle 
(-J’o+;>J+i>5+rf+wi2c2)T = 0, 
if the quantities a^, a^, jS obey the relations 

^ = af = 1 , = 0 , etc., | 

jS* = a^iS+iSa* = 0 , etc. I 

Once the quantities a^., 04 ,, a^, and j3 have been determined firom these 
equations, the wave equation for an electron in an arbitrary electro- 
magnetic field can be taken to be (19). It should be noted that the 
wave-function T appearing in this equation need not be a scalar; we 
shall see later that, because of the nature of the and jS, the wave- 
function T must, in fact, be assumed to have four components. 

f P. A. M. Dirac, Proc. Boy, Soc. A, 117, 610 (1926). 
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The Dirac equation (19) assumes a convenient form if we write 
(x,y,z,ict) = 

(•^x> (^1) ^ 4 )* 

= (0:1,012,0(3, 0:4), 

and put = (j=l,2,3,4). ' 

t OXj c 

With this notation equation (19) may be written in the form 

{ 2 o(^n^— *114+0(4 twcIt = 0, ■ (21) 

'i*! ' 

where now the quantities ag, ag, oc^ satisfy the relations 

“A“v+“v«A = 28 ai. (22) 

(A, V = 1,2, 3, 4). If we write the three quantities (a^, ag) as a vector 
a and introduce a vector operator 11 in the same way, we may write 
equation (21) in the form 

(a.n— in4+(3£4?wc)T = 0. 

It is sometimes more convenient to write the Dirac equation (21) in 
a more symmetrical form by introducing the operators 

ft = ta4ai, ^2 = ^*“4«2» = ^4 = “4* (23) 

Multiplying equation (21) on the left by ioL^ and substituting from 
equation (23) we obtain the equation 

= 0. (24) 

The relations (23) give the operators in terms of the operators a^; 
the inverse relations can be readily shown to be 

®1 ~ “a ~ %^4> “8 = %^4» “4 = A* (25) 


53.2. The Dirac matrices 


So far the only information we possess about the quantities oc^ 
(j = 1,2, 3, 4) is that they satisfy the equations (22). From these 
equations we now attempt to obtain a matrix represwitation of these 
quantities. The equations (22) are, of course, satisfied by the Pauli 
spin operators era., <Ty, o’, introduced on p. 97, but if we choose these 
operators ab three of our a’s we find it impossible to determine the 
fourth. If, however, we extend the Pauli matrices in a diagonal manner 


(<7ij OTg, CTg) 



(26) 
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where o denotes the symholio vector (a^, and if we introduce the 
matrices /o 0 1 0\ /O 0 -t 0 \ 

_/0001| _ 0 0 0 -i\ 

li 0 0 or ^ li 0 0 or 

\o 1 0 0/ \o i 0 0 /• 


\0 0 . 0 - 1 / 

we find that the a’s and p’s satisfy the relations 

(T,(T,+(T,(7, = prPs+PsPr = 28„ I 
Pr<^l = <^lPr „ ' 

If now we define a set of quantities by the equations 

= Pi “2 = Pi ^2y “3 = Pi “4 = />3> (29) 

we find that all the conditions (22) are satisfied. Performing the matrix 
multiplications we obtain for the a’s the four matrices a 4 = and 
/o 0 0 i\ /o 0 0 -A 



0 

0 



0 

0 

0 

0 

1 

o\ 

'0 

0 

i 

“‘= 0 

1 

0 

0’ “* = 


— i 

0 

\l 

0 

0 

0/ 

\i 

0 

0 


From equation (23) we obtain for the |3’s the four matrices = ol^ = 
/ 0 0 0 t\ /O 0 0 1\ 

. I 0 0 i oi „ lo 0 -1 o\ 


0 0 -i 
0 0 0 


Furthermore, it can be proved that any four-rowed square matrix 
can be expressed as a linear combination of the sixteen matri(|es 

«!, a2» ®8 j Pv Ply Ply ^ 

otiptj, Ojaj, Ojai, ociagaa, cx2a8a4, oL^oLiOt^y ai«a«8«4» 



300 


RELATIVISTIC QUANTUM MECHANICS 


iS3 


where I denotes the unit matrix 

ri, 0 

0 1 

u I 

It should also be noted that all these matrices are unitary and 
Hermitian.f 

The following properties of the a and p matrices are of use in the 
application of the theoi^ to the solution of special problems. The first 

PiP2 = ^PiPi = j ^32) 

0’i0’2 — (72 O’j = ^^3! 

follows directly from the definitions of the quantities involved. Also 
if B is any Vector commuting with a, then by making use of equations 
(32) we can show that 

(a.B)2 = I (33) 

1,2,3 • s 

Substituting B=p+-A = ^ grad + - A 

Cl c 

Jnto this equation, we obtain the relation 

(ff.p-f^Aj = ^P+^aJ +j(a.curIA). 


(34) 


53.3. The relativistic invariance of the Dirac equations 
Since the quantities a and ^ may be represented by matrices of four 
rows and columns it follows that the wave-function Y upon which these 
quantities operate must itself be a column matrix with four elements; 
we shall write it in the form 




(36) 


With this form of W and the matrix representations of the j3’s given 
by the equations (31) we obtain by direct matrix multiplication 

%\ 7 %\ / Y.\' / %\ 








t If A is a matrix, A' its transposed (obtained by interchanging its rows and columns), 
its inverse, and A* the matrix whose elements are the complex oonjpgates of those 
of At then if A A*', A is said to be Hermitian and if ^ A is said to be 

see Turnbull and Aitken, Canonical Matricea (1932), p. 33. 
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Substituting these expressions into the left-hand side of equation (24) 
we obtain the Dirac equations in the following form: 


|ift^-e^+mc*jTi+cn_4;+cn3'n = 0 
|ifti_e^ + mc*J'F3+cn+’F3-cnsT4= 0 
Ji^i-e.^-wicsJ’fi+cn.Yj+cnsTi = 0 
U^-e<l,-mc^\%+cn^%-cYl^% = 0 


(36)„ 


(36) determines the four scalar wave-functions Tg, Tg, and In 
these equations we have written 0 +, n_ to denote the operators ITii Hg. 

In the form (36) the Dirac wave equations are unsymmetrical and 
so.it is necessary to study their behaviour under a rotation of axes or 
under a Lorentz transformation. The proof of the invariance of the 
Dirac equations under a general transformation of the type indicated 
was due originally to Dirac ;f an interesting proof based on the in- 
Sniteaimal elements of the Lorentz group was given subsequently by 
Pauli.} The general formulae are complicated, but it is a simple matter 
to verify the invariance for certain special transformations. 

We consider two observers moving with constant velocity relative 
to one another, one describing his observations of the behaviour 
of an electron in an electromagnetic field by a set of coordinates 
= ict) in a space of four dimensions, the other by a similar 
set of coordinates = ict'). Then according to the special 

theory of relativity the two coordinate systems are related by the 
matrix equation ^ ^ ( 37 ) 


th^it is, (i = ^ 

^*=1 

where the coefficients are constants. 

> The first observer will represent his knowledge of the coordinates 
of the electron by a wave function of the type (36) sc^tisfying the Dirac 
wave equation ^ 

(^2^^S,n,+mc)Y=0. (38) 

Writing (37) in the form x' = 


t P. A. M. Dirac, Proc. Roy, 80 c. A, 117, 610 (1928). 
J W. Pauli, Handbuch der Phyaik, 24/1. 
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and noting that 

dx'~' dx' dx'* dx ' dy' dx* 
we see that the transform according to the rule 

n = co-iR'. 

Now there are two possible ways in which we may discuss the trans- 
formation properties of the Dirac equation. We may regard the wave- 
function T as an invariant (T = T') and the j8^ as the components of 
a 4-vector, The components of this 4-vector are, of course, matrices. 
If j8 = transforms as a 4-vector it transforms like x so 

/sn = j3'a)co-in' = jS'n' 

and the Dirac equation is transformed to 

(2i3^n;+imc)T' = 0, (39) 

which is of the same form as equation (38). 

Alternatively, we can always find a matrix 8 such that 

n = co-in' = (40) 

If now the are regarded simply as determining the coefficients in 
the four wave equations (36) and are kept constant, they are exchange- 
able with 8 since they act on the inner variable only, so that 
(S“ii3n'S)T+imcT = 0. 

Multiplying on the left by 8 we obtain 

(pn'+imc)(8Y) = 0 , 

which is equivalent to equation (39) if we write 

T' = 8Y. 

This p^t of view must be adopted if we consider the Dirac equations 
as field equations, for then Y-must be transformed to a new coordinate 
system in a manner analogous to that in which E and H transform 
in the Maxwell theory. 

The form of the matrix S has been determined by Darwinf for some 
special two-dimensional linear transformations; the results are tabu- 
lated below; ^ 

(i) Bi)UUicn Ihrough an angle a about the z-axia 


/ cosa 

sinoi; 

0 

ov 


/e*«« 

0 

0 

® \ 

/ — sina 

C08CK 

0 

0 

l. ■ ^-1 

0 


0 

0 \ 

o 

li 

3 

0 

1 

01 

1 ° 

0 ' 


0 ) 

\ 0 

0 

0 

li 

r 1 

1 0 

0 

0 

e-H./ 


t C. 0, Darwin, Proc, Roy, 8oc, A, 118, 664 (1928). 
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(ii) BotaUon through an angle ol about the y-axis 

( cos a 0 — sina 0\ / cos}cif sin^^ot 0 ^ \ 

0 1 0 o\ /--sinja cosia 0 ^ | 

sincK 0 ooso£ ^ / i ^ ^ oos-Ja sin-|a( I* 

0 0 0 1/ \ 0 0 — sinia cosja/ 

(iii) Uniform relative motion ahng the z^axia * 

( 1 0 0 0 \ /coshijS 0 sinhJjS 0 \ 

01 0 coshJ|3 0 — sinhJjSj 

0 0 coshjS sinhjS r IsinhijS 0 coshJjS 0 I 

0 0 sinhjS cosh/S/ \ 0 -sinliJjS 0 coshi^/ 

These three transformations are all that need be considered since any 
Lorentz transformation can be built up from them. The matrices S 
have the peculiar characteristic that their elements involve the angle 
\ql w6en the axes of coordinates are rotated through an angle a, so that 
8 is somewhat analogous to the square root of cu. The wave-function 
T* cannot therefore be a tensor. We thus find that the Dirac equation 
of the electron is invariant under Lorentz transformation and yet it 
could only be written in tensor form in a highly artificial manner. 
This is an example of the mathematical principle that the ordinary 
tensor formalism does not comprise cdl possible representations of the 
Lorentz group. We shall return to this point later (§58 below); for the 
moment we merely observe that the four wave functions can be split 
into two pairs, for instance, ('ll, ^l^) and (T^, TIj) in case (i) above, with 
transformation equations of the form 

T, = al%+at,%y 

and with the coefficients ct^j satisfying the relation 

®11 ®12 _ Y 

®21 ^22 

53.4, The spin of the electron 
If in Dirac’s equatioh (21) we substitute for the a’s from equation 
(29), we obtain the wave equation of the electron in the form 

|po+^^+Pi|®-P+”-^j+P8^|^ = 

Multiplying on the left of this equation by the operator 


(41a) 

(415) 

(42) 
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proceeding as in §53.1, and making use of the relations (28), we obtain 
th^ equation , 


[ - {po+l^J + (® -P +^^)* + 


+ Pl 


The expression in the curly bracket may be reduced to 

where E is the electric vector of the field. Substituting for (o.p+eA/c) 
from equation (34) and writing 

H = curlA 


for the magnetic vector of the field, we obtain finally the wave-equation 


The Hamiltonian oa the left-hand side of this equation differs from 
that of the Klein-Gordon equation by the addition of two terms. 
Dividing these two extra terms by 2m we obtain an expression for the 
potential energy of the electron due to its spin. The fomier of the 
terms in this expression for the potential energy 


2mc 


(«.H) 


shows that the electron possesses a magnetic moment of amount 
€jSo/2mc due to its new degree of freedom. This is precisely the mag- 
netic moment assumed in the Uhlenbeck-Goudsmit model of the spin- 
ning electron; in Dirac’s theory the electron spin is a direct consequence 
of the form of the relativistic wave equation for the electron. 

The second additional term in the Hamiltonian shows that the 
electron behaves as though it had an electric moment iejto/2mc, which, 
hemg a pure imaginary, does not appear in the spinning electron model. 
This imaginary quantity only appears when the Hamiltonian (43), 
which is real, is transformed in a rather artificial/way so as to make 
it resembla the Hamiltonian of the Klein-Gordon theory, and does not 
appear to be capable of being incorporated into any model of the 
. deotron; 

We shall now determine the angular momentum operators associated 
ndth the Dirac equation; we shall assume that the operator representing 
the total angular momentum of the electron is the sum of the orbital 
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and the spin angular momentum operators, and that it is a constant 
of the motion; this means that it commutes with the Hamiltonian of 
the Dirac equation. If we take A to be zero in equation (43) and write 
= F(r), we see that the Hamiltonian reduces to 

Now the orbital angular momentum L is defined by the equation 

L = rAp, (46) 

and satisfies the relations 

I^xVv-PvI'x = i^Vz- (^®) 

hPz-PzI^x = i^Pv‘ 

Substituting for H from equation (44) we have 

L,H-HL^ = Pi{L^(o.p)-(o.p)4} 

= Pi{o-i'*P-P4} 

= ihpiiaiPz—a^Pi} 

by the use of equations (46). Similar equations may be derived for the 
components Ly and leading to the vector equation 

LH-NL = iftpi(oAp). (47) 

Similarly we have 

= Pi{<ri(o.p)— (o.p)<ri} 

= ‘i‘ipi[ozP%-°2P^ 

by the help of equations (32). Thus we obtain the equation 

oN-Na= -2ipi(oAp). (48) 

Eliminating the vector product (oAp) between equations (47) and 
(48) we obtain the equation 

iE-H3 = 0, (49) 

where J = ^+1^- 

Equation (49) establishes that J is a constant of the motion; we can 
interpret this result by postulating that the electron has a spin angular 
momentum defined by the equation 

s = 

or, in terms of the a’s, 

Sy. = (51) 

It follows at once from these equations that 

«| = «5 = 4 = (P)*. 

X 
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in agreement with the spin postulated by Uhlenbeck and Goudsmit, 
and that the operators satisfy the formal equations 

— -“^v^x = ¥^z ^ 

of the Pauli-Darwin theory. 

The manner in which the spin comes out in a natural way as a result 
of setting up the proper relativistic Hamiltonian is one of the major 
triumphs of Dirac’s theory. By contrast, in the Pauli-Darwin theory, 
the spin had to be added as an additional physical assumption. 

53.6. The quantization of the total angular momentum 
As a first illustration of the application of Dirac’s theory we shall 
calculate the quantization of the total angular momentum of a single 
electron. This calculation is an extension, to include the spin of the 
electron, of the method in the non-relativistic theory by which we 
classify the states of the electron in terms of the orbital angular 
momentum. The problem is to determine a wave function T* to repre- 
sent a state in which the square of the total angular momentum and 
one of the components of the total angular momentum are quantized; 
i.e. we wish to determine a wave function T* satisfying the equations 

J2Y = AY, = (52) 

where A and B are constants and J is the operator corresponding to 
the total angular momentum. This is made up of the sum of L, the 
orbital momentum and 8, the spin angular momentum, so that 

<J = L-{-8. 

If, for simplicity, we write 

then J® = (L-f-s)^ = Ij_s^-\~2Lfg8g), 

Now in polar coordinates (cf p. 87) 



We may therefore write the second of equations (62) in the form 

= 0. (62a) 
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Now 


ioLiOL^Y = 



0 0 0\ 
1 0 o\ 
0 -1 ol 
0 0 1 / 


so that equation (62 a) may be written as 




= 0 , 

{ih^+ih+Bj(%X) = 0 . 

Similarly the former of the equations (52) when expressed in terms of 
scalar wave functions takes the form 


(L2+p2^^2.,-^)(T2,T,)+^L,('Fi,'F3) - 0, 
where the operators L^, L^, are defined by the equations (63). An 
inspection of these equations shows that they form two identical sets, 
one containing only the wave-functions T 2 and the other only Tj 
aiid The solution of these differential equations is very similar to 
that of the equation for the determination of the angular function in 
the Schrodinger theory of the hydrogenic atom, and will not be repro- 
duced here. The quantized values of and are found to be 

^ =i(i+l)^^ 

where m = ±|,... and j = f,... and is such that j ^ |m|. 

The solutions of equations (62) for given choices of m and j are readily 
found to be 

where ^) denotes the surface spherical harmonic 

(-«<»»< i). 

The functions f(r), g{r), F(r), and 0(r) are not determined by the 
equations (62). 
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53.6. DarwirCa variation principle 

Darwinf has shown how to derive both the Maxwell and the Dirac 
equations from a variation principle 

SS = S jjj j L dxdydzdt — 0, (66) 

where L is given by the equation 



+ complex conjugate of this expression 


%+Y* %+Y* 'Fj+n Ti) 

c 

^4+1^* %-iY* T.+iT* Ti) 
c 

® t 

The function L is considered for the purposes of the variation to be 
a function of the T”s, T*’s, A^y Ay, Agy and <(), If we vary the function 
S it is readily seen that the Euler-Lagrange equations yield equations 
(36) and their complex conjugates and also the equations 

( 57 ) 

J)^=-4^e(TJT4+T*T,4-'F*'r,+'F4*'r4), (68) 

with two similar equations for Ag, and Ay. Comparing these equations 
with the corresponding equations in the Maxwell-Lorentz theory we 
t C. G. Darwin, Proc. Roy. 8oc. A, 118 , 654 (1928). 
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find that the electromagnetic effect of the electron can be represented 
by a charge density — pe, and a current density — ej, where 

p = (Tf Ti+T* %+Yt Tg+T* Y,), (69) 

further the components of the probability current are given by the 
equations ^ ’I'+'F* Ts+T* 'Fj+'F * \) 

j,= -cm 

provided that the T’s are normahzed according to the rule 

to ensure that the total charge shall be — c. 

With these definitions of p, j it may readily be verified that the 


(60) 


(61) 


continuity equation 


| + divj = 0 


(62) 


is satisfied. 

An interesting interpretation has been given to these results by 
Breit.f From the first of equations (30) we have that 


“1 


T = | 


and hence that 

'F%'F= Yf Tj+Tj'Fjir’Fj'Fj+'FJ'Fi. 

Thus we may write the right-hand side of the first of equations (60) 
as T*(— cai)T'. Similarly the right-hand sides of the second and third 
equations can be written as 'F*(-ca 2 )T, Y*(-co( 3 )T'. Introducing the 

a = (oti.aj.oia) 

we may therefore write equations (60) in the vectorial form 

j = Y*(-ca)T. (63) 

This equation suggests that in the Dirac theory the vector operator 
— ca represents, in some sense, the velocity of the electron. For instance, 
when we set up the wave equation for the relativistic two-body problem 
(p. 336), we replace the scalar product (v‘.v®) occurring in the classical 
Hamiltonian by c*{a^a*); though this substitution is suggested by the 
analysis given above, its justification lies m the fact that it leads to 
results which are in agreement with experiment. 

f G. Breit, Proc, Nat. Acad, Sci. 14 , 663 (1928). 
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The Dirac theory differs from the Schrodinger theory in that in the 
latter there is no operator for the velocity, only for the momentum. 


54. States of negative energy in Dirac’s theory 
54.1. Solution of the Dirac eqmtions for a free particle 
For an electron moving in free space both the vector potential A 
and the scalar potential are taken as zero in the equations (36). 
These equations then reduce to the form 



The functions 

'F* = C'*expi(p.r-jrt) (fc=l,2,3,4), (64) 


which represent an electron moving in free space with momentum 
P == iPvPi^Pz) 6i^®rgy Wi are special solutions of the above 

differential equations if the relations 

(W+7nc^)Ci+cp^C^+c(p^-ip^)C^ = 0 ^ 
(W+rm^)C^-cpj^C^+c(p^+ip^)C^ = 0 
(IT— wic2)(78+cp80i+c(pi— ip2)Ci = 0 
(W-nic^)C^-cp^C2+c(pi+ip2)C^ = 0 , 

are satisfied. In the equations (65), pi, pg? W are real and the 

constants Gjg may be complex. Eliminating the quantities from these 
equations, we obtain as a condition for the existence of non-vanishing 
solutions of type (64) the determinantal equation 


0 Gp8 c(pi-tip^) 

0 W+mc^ ^(Pi+iP2) 

cpa c(pi— tpg) W—mc^ 0 

c(pi+ip2) — cp8 0 IT— me* 

which is easily seen to reduce to the form 

= 0 . 




I 64 STATES OF NEGATIVE ENERGY IN DIRAC’S THEORY 311 

Regarded as an equation for the determination of W this equation has* 
the two double roots 

Tr+ = +C(m2c2+p2)^, W- = (66) 

Only the positive root would have any physical significance in the 
classical theory; in quantum mechanics, as we shall see below, both 
solutions must be considered. 

Since both of the roots are double we may choose two of the 
constants arbitrarily and determine the remaining two from any two' 
equations of the set (66). Suppose we take W to be the root If + and 
choose and Q to be arbitrary complex constants A and By say, then 
from the first and second equations of the set (65) we have the solution 

r - r _ _APi+ipM-p»B 

^ ;W^+nw^ ’ 

C, = A, G,=^B 

The number of electrons per unit volume described by this solution is 

^ 2WHAA*+BB*) 

W++mc^ * 

Similarly if we take W to be W~, Ci = C, Cg = D, we obtain from 
the third and fourth equations of the set (65) the solution 

C = C C ^ D \ 

r _ ,P3^-^<<Pi-iPi)^ r _ (68) 

Os ^ -c , C, - -C j 

which corresponds to 
electrons per unit volume. 

In each case the four scalar wave-functions are obtained from 
equation (64). 

The existence of solutions of the wave equation for negative energies 
gave rise to much speculation on their proper interpretation, since states 
of negative kinetic energy have not been observed. If they were, the 
acceleration of the electron would be in the opposite direction to that 
of the external force producing it. In the classical relativistic theory 
this difficulty does not arise, since, though the energy of a free particle 
in terms of its momentum is given by either of the roots (66), the energy 
of an electron can only change continuously; thus an electron which 
initially has positive energy can never reach a state of negative energy, 
owing to the gap between ±mc^. In the quantum theory, on the other 
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hand, discontinuous energy transitions are common in applications to 
practical problems. The possibility of transitions from a positive to a 
negative energy has been investigated by IQeinf and in more detail 
by Sauter,t who found that transition probabilities between states 
of positive and negative energy are small for the fields occurring in 
ordinary circumstances. Nevertheless, all theoretical attempts com- 
pletely to eliminate such transitions have been unsuccessful. 

The physical interpretation of the states of negative energy is due to 
Dirac, § who formulated the 'hole theory’ to avoid the difficulty of an 
electron making a transition from a positive energy state into one of 
negative energy. The first basic assumption of the Dirac hole theory 
is that, in the physical world as we observe it, almost all of the states 
of negative energy are 'occupied’. In accordance with the Pauli ex- 
clusion principle it is assumed that each state is occupied by only one 
electron. The second assumption is that when all the negative energy 
states are occupied the distribution of negative energy electrons does 
not produce an external field and is consequently not observed. The 
distribution in which all the negative energy states are occupied and 
all the positive energy states are empty may be taken as a zero point 
for the measurement of electric charge, momentum, and energy. 

If we assume that an external field can remove one of the electrons 
with negative energy W = — and momentum p leaving a 'hole’ 
in the distribution of negative energy electrons, then, by the second 
of our two assumptions, we shall observe the hole as a particle with 
electronic rest-mass and with its other physical properties given by the 
equations 

liri, e+ = -e, p+=-p, 


where c denotes the electronic charge. A hole in the distribution of 
negative energy states would thus appear as a particle with positive 
kinetic energy, which is more in agreement with our concepts of the 
nature of fundamental particles. Furthermore, the theory predicts 
that such a particle, though possessing the same mass and numerical 
charge as the electron, will be 'positively charged. The existence of 
such particles was confirmed shortly afterwards by C. D. Anderson, H 
who in the course of some observations of tracks produced in a Wilson 
chamber by cosmic-ray particles in the presence of a strong' magnetic 
field obtained the photograph of a track which could only have been 

t 0. Klein, Zeite./. Phydk, 53, 167 (1929). 

t F. Sauter, ibid. 69, 742 (1931); 73, 647 (1931). 

§ P. A. M. Dirac, Proc. Boy. Soo. 126,^60 (1930) ; Quantum Mechanica (1947); chap. zii. 

U C. D. Anderson, Science, 76, 238 (1932); 77, 432 (1933); PApc. Pec. 43 , 491 (1933). 
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produced by a positively charged particle with the same mass and 
numerical charge as the electron. Subsequent experimental workf has 
confirmed Anderson ^s observations oi 'positrons^ as the new particles are 
called. 

Dirac’s hypothesis that the positrons should be identified with holes 
in the distribution of negative energy states is in agreement with all the 
experimental facts now at our disposal. In the theory of the positron 
we have, however, to consider an infinite number of electrons occupying 
the states of negative energy ranging continuously from —mc^ to —oo. 
Various attempts^ have been made to resolve the mathematical diffi- 
culty of handling this infinity, but none of them can be regarded as 
completely satisfactory, though they have all met with some success 
in certain directions. The theory of holes in its present form, though 
not yet completely satisfactory, appears nevertheless to be correct in 
general outline. 

54.2. The production and annihilation of positrons 

The experimental discovery of the existence of positrons in the 
secondary cosmic rays has led to a search for other sources of positrons. 
Shortly after Anderson’s discovery of the existence of the positron it 
was found that when certain substances were bombarded by high-speed 
ions positrons were produced. For example, by bombarding aluminium 
with a-particles Curie and Joliot§ produced a radioactive isotope 
of phosphorus which then disintegrated into a stable isotope of silicon 
with the emission of a positron. Perhaps the most spectacular discovery 
was that positrons can be produced by the passage of hard y-rays 
through matter. This process can be interpreted on the hole theory in 
a simple manner. If the field of the y-ray causes an electron in a state 
of negative energy (— |ir|,p) to jump to a state of positive energy 
(W'jp')f then the transition results in the creation of a pair consisting 
of an electron of energy W = W' and momentum p' and a positron 
of energy W+= \W\ and momentum — p. This phenomenon is called 
pair production. If the energy of the y-ray is hv, then, since for the 
creation of a pair an energy equal to is necessary to give the 
particles the correct rest energy, it follows that for the process to occur 
we must have > 2mc^, 

t L. Meitner and Philipp, Naturwies, 21, 286 (1933 ) ; J. Chadwick, M. S. Blackett, 
and G. P. 8. Occhialini, Froc. Boy. Soc. A, 144, 236 (1934); Nature, 131, 473 (1933); 
I. Curie and F. Joliot, Comptea Rendus, 196, 1681 (1933); J.de Physique, 4, 494(1933). 

I P. A. M. Dirac, Froc. Camh. Phil. Soc. 30, 160 (1934); W. Heisenberg, Zeita. /. 
Phyaik, 90, 209 (1934); V. Weiaskopf, Kgl. Danak. Vid. Selak. 14, 6 (1934). 

§ I. Curie and F. Joliot, CompUa Bendua, 198, 264, 408, 669 (1934). 
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Substituting the known values of the electronic mass and the constants 
A, c, we find that for pair production to occur the y-ray must have 
energy in excess of 1*02 Mev. and therefore a wave-length less than 
0-012 A. The excess energy hv—2mc^ appears as the kinetic energy of 
the positron and the electron. The cross-section for the creation of a 
positive electron with energy W+ and a negative one with energy 
has been calculated by Bethe and Heitler.t In the case where all the 
energies are very much greater than the rest energy mc^ of the electron, 
but not so great that the screening of the Coulomb field by the outer 
electrons is appreciable, the cross-section for the creation of a positron 
of energy between and W^+dW^ and an electron of energy 
(= hv—2mc^—Wj^) is found to be 




(hv)^ 


L 2W^W. 1 \ 

rise -5)' 


(69) 


where ^ is defined in terms of the nuclear charge Z by the equation 

Zh* 






For the case of complete screening and high energies the corresponding 
formula is 






The total number of pairs created in a process of this type can be 
determined in the cases of negligible and complete screening by inte- 
grating equations (69) and (70) over all possible values of W+. In this 
way we obtain for the cross-sections for pair formation 

^ 27/’ 

, jr/28, 183 2\ 

Corresponding results can be obtained^ by numerical integration in the 
case of energies for which the screening is partial. The theory of pair 
formation developed in this way has been compared with experimental 
data§ and found to be in good agreement with the observed results. 

Pairs can formed in several other ways. They can be formed 
during the passage through matter of a proton or an (x-particle whose 


t H. A. Bethe and W. Heitler, Proo. Boy, Soc. A, 146, 83 (1934); H. A. Bethe, Proc, 
Comb. PhU, Boo, 30, 524 (1934). 

i Loo. oit. and H. B. Hulme and J. C. Jaegor, Proc. Boy. Soc. A, 153, 443 (1936). 
j W, Heitler, Qmnhun Theory of Badiaiion, pp. 201-2 (Oxford, 1944). 



§64 


STATES OF NEGATIVE ENERGY IN DIRAC’S THEORY 


315 


kinetic energy exceeds twice the rest energy of an electron, f or by 
the motion of an electron through the field of a nucleus.J Furthermore, 
an electron pair can be created in the field of a nucleus by a y-ray 
emitted by the nucleus itself, § or by the action of two light quanta 
whose total energy exceeds Finally, if an electron of energy 

greater than collides directly with an electron at rest an electron 
pair can be created. || 

Since a pair consisting of a positron and an electron can be created 
we would expect that they can also be destroyed. In the hole theory 
this would correspond to the transition of an electron from a state of 
positive energy to a hole in the distribution of electrons in negative 
energy states. The energy of the positron and the electron is given 
out in the form of radiation. The most common process of annihilation 
is that in which a positron combines with a free or lightly bound electron 
and their energy is taken away in the form of two photons. If the 
momenta of the colliding particles are equal and opposite the photons 
have equal frequencies, but if the electron is at rest the two photons 
have different frequencies. Since the energies of the colliding particles 
must exceed 2?wc^, a simple calculation shows that the wave-length 
of the annihilation radiation must in the former case be less than 
0-024 A. The cross-section for such a process in which a positron of 
energy Wj, collides with an electron at rest has been calculated by 
Dirac, tt who derives the formula 




7rC» 


mV y+l[ 

where y denotes the ratio W+jmc^, For very high energies this equation 
reduces to the simple form 


Tre* 

ymh'^ 


[log2y-l]. 


(72) 


This theory of two-quanta annihilation has been verified experimentally 
by Klemperer.JJ 

An alternative process is the combination of a positron and an 
electron near a nucleus with the emission of a single photon which acts 
as a carrier for the total energy of the two particles. The case which 
can be calculated most easily is that in which the electron is bound 


t W. Heitler and L. Nordheim, Joum. d. Phys. 5, 449 (1934). 
J Fot references see W. Heitler, op. cit., p. 203. ^ 

§ W. Heitler, op. cit., p. 204. 

|j F. Perrin, Comptes Renduitf 197, 1100, 1302 (1934). 
ft P. A. M. Dirac, Proc. Camb, Phil. 8oc. 26, 361 (1930), 
tt 0. Klemperer, ibid. 30, 347 (1934). 
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in the iC-shell of an atom; the formula for the cross-section of this 
process corresponding to equation (71) is 

(k ~ I La I 2 I 1 ?d!l?_log{y+(y^— 1 )h1 

137%V(y+l)V-l)*L (y2-l)r 

(73) 


where as before the energy of the positron is 
energies this reduces to 

^ _45TC*^r 2 _ 1 Jog 2yl . 


For large 


A comparison of the formulae (72) and (74) shows that the cross- 
section for the annihilation of one photon is always smaller than that 
for the annihilation of two. The maximum value of the ratio 
occurs when y is approximately equal to 10; in lead this gives 

= 0*20. 


54.3. Relativistic theory of the Compton effect 

One of the most striking proofs of the fact that the states of negative 
energy arising from Dirac’s equation must be considered as well as 
those with positive energy is provided by the relativistic theory of the 
Compton effect.f The radiation process considered in the Compton 
effect is the collision of a quantum of light of momentum ko with a 
free electron of rest-mass m which is initially at rest. In calculating 
the transition probability from this initial state to a final state in which 
the electron has a momentum p and the light quantum a momentum k 
we must consider intermediate states in which the energy is not con- 
served though the momentum is. It is found that the only possible 
processes are the following: the electron absorbs the quantum kp and 
acquires the momentum p' = k^ and then emits a quantum k in the 
transition to the final state; alternatively, the electron first emits a 
quantum k changing its momentum to p'" = — k and then absorbs the 
quantum ko in the transition to the final state. 

Now to each of the values p of the momentum of the electron there 
are altogether four states corresponding to the two directions of spin 
and the positive and negative values of the enei'gy. If we take the 
negative energy states as intermediate states as well as the positive 
ones we obtain for the differential cross-section the Klein-Nishina 


formula]; 


dQ^[^^+~2+4co8»0j, 


( 76 ) 


t 0. Klein and Y. Nishina, ZeUs.f, Physik, 52, 863, 869 (1929). 

X W. Heitler, Quantum Theory of RadkUion^ 2nd ed., § 16 (Oxford, 1944). 
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where 0 is the angle between the directions of polarization of k and 
kj and r# denotes the classical electronic radius e*/»ic. If 6 denotes the 
angle between kg and k, the relativistic relation between energy and 
momentum and the conservation of energy and momentum lead to 
the equation 

I. _ , 

»w:*-|-l:gc(l— CO80)^®' 

If the primary radiation is unpolarized and of intensity /g the 
intensity of the scattered radiation at a distance B from the scattering 
electron is then readily shown to be 1(6) dLX where 1(6} is given by the 
formula 

I(6\ = ? /i I /(I— cosfl)* ) 

^ {l+y(l-cose)}^ (i+cos*e){l+y(l -cos 0)}| 

(77) 

and y = ^q/wic. The intensity distribution for a radiation of wave- 
length 0*14 A (y = 0*173) in carbon has been measured by Friedrich 
and Goldhaber,*!’ and it is found that the curve for derived from 
these measurements is, within the limits of the experimental error, in 
exact agreement with that determined from the above equation. 

The total scattering may be obtained from equation (77), by inte- 
grating over and the results compared with the experimental values. 
It is found that the theoretical curve so determined agrees well with 
the experimental values at least for energies up to lOmc^.J On the 
other hand, other relativistic wave equations, such as the Klein-Gordon 
equation, lead to results which are not in agreement with the experi- 
mental measurements. The Klein— Nishina formula may be regarded as 
proved, at least for energies in the range considered, and its exactness 
considered as direct evidence in favour of the Dirac theory of the 
electron, when the negative energy states are treated with the same 
weight as those of positive energy. For, if we entirely disregard the 
intermediate states of negative energy, the formula we obtain for the 
Compton scattering is appreciably different from the Klein-Nishina 
formula and is no longer in agreement with the experimental evidence. 

55. The relationship of Dirac’s equations to previous theories 

The fact that in practical problems two of the scalar wave-functions 
^ (fe = 1, 2, 3, 4) have usually greater magnitude than the remaining 
pair enables us to develop the Dirac wave equations in an approximate 

t W. Friedrich and G. Goldhaber, Zeits.f. Physiht 44, 700 (1927). 
t W. Heitler, loc. oit., p. 160. 
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manner which brings out their relation to the non-relativistic theories 
of Schrodinger, Pauli, and Darwin considered in previous chapters. 

If we wish to find a periodic solution 

of the scalar wave equations (36), then we have to solve the system of 
equations 

(IT— e<^+mc2)xi+c(ni— in2)x4+cn8X3 = 0 > 

(ir-e<^+mc*)x2+c(ni+m2)x8-cn3X4 = 0 
(IT— e^— mc*)x8+c(ni— in2)x2+cn3Xi == 0 
(Pf-e^-~mc2)x4+c(ni+in2)xi-cn3X2 = 0 

Now in the case of positive energy states, if the velocity of the electron 
k small in comparison with the velocity of light, W is approximately 
equal to the rest energy mc^ of the electron; therefore 


c 


Substituting from thk equation into the first two equations of the above 
set we obtain immediately in the case A = 0, 



If we substitute these values of xi and X 2 into the third equation of 
the set (78), we find that X 8 satisfies the equation 

'^’X 8 +^(W"-»»c*-e^)Xs = 0 . (80) 


Now if IT k nearly equal to wic*, the quantity W—rm^ represents the 
kinetic energy W* so that‘X8 ^ a solution of the non-relativistic 
Schrodinger equation 

jv»+^{If'-F)}x=0. . (81) 

The same result holds for X 4 . It is clear from equations (79) that Ixil 
and Ixil are small compared with Ixsl and IX 4 I; thus, in a first approxi- 
mation, they can be neglected in the expressions (59), (60) for the 
charge and current density. We then have approximately 
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These results provide an approximate solution of the Dirac equations 
(36); if T is a solution of the non-relativistic Schrodinger equation (81), 
then the following are approximate solutions of (36): 

Yj = -c^w, 'j; = -CjT, 

w - ^a(ni-tn,)+fin3 _ c,(n,+i n,)-c, n,™ 

* 2»»c ’ * 2mc ’ 

where C^, Cj are arbitrary constants and IIi, IIj, IIj are interpreted in 
the usual way as operators. 

For negative energy states the roles of T4 and % are reversed. 

To show more fully the relation of the Dirac equations to the non- 
relativistic theory of Pauli and Darwin, we introduce the two wave- 
functionsf 




Xi 


jfg == e^mcHinl 


(82) 


The Dirac wave-function T* can then be written in the form 


^ = 1 


= (Xi-Axa)® 


-imcHIh 


(83) 


where A denotes the matrix 


A = ioLi 0i2 = 



(84) 


Differentiating both sides of equation (83) with respect to the time, 


( 86 ) 


Now T satisfies the Dirac equation 

n,+a4 = 0, (86) 



t We consider here the esse of positive wiergy states only ; the analysis is similar for 
negative energy states, cf. E. L. Hill and R. LandshofI, Rev. Mod. Phya. 10, 120 (1938). 
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we may write «! Xi = — Actj xi> 

or, in general, °tkXi = -Aa^^Xx, '^ 1 X 1 = Xv (87) 

where Ic 2, Z. SimUarly 

“*Ax 2= -ffjfcXa, ix^Kxi= -Axi- (88) 

Substituting from equations (85), (87), and (88) into the Dirac equation 
(86) we obtain 

{c ^ + (2»i«*+e^)Xi] + 

+ a|c ^Tljaj.xi+ih^+e(l>x^ = 0 . 

Since the operator A and the unit operator are independent, each of 
the expressions in brackets must vanish. In this way we obtain the 
equations , ^ . 

\in -- + 2mc^+e<tj Xi+c(a . = 0, (89) 

+ e<^j X2+c(a . n)xi = 0, (90) 

where we have written 

(a.n)= icT^n^. 

Now we can write equation (89) symbolically as 

5 ^ (g-n)x2 

'2mc[l+{ih(dlBt)+e<l>}l2mc^y 

showing that is given approximately by the equation 

Substituting from equation (91) into equation (90) we obtain 

where S;= -e^+^(o.n)«-j^(o.n)|i«|+4j(a.n). (92) 

Now from equation (34) 

(o.n)* = n*+-(o.ourlA) = n*+-(«.H),^ 

C C 



§ 65 DIRAC’S EQUATIONS RELATED TO PREVIOUS THEORIES 321 
since by Maxwell s equations H = curl A. Furthermore, 

(»«|+e^)(o.n) = (a.n)|i«|+«^j+t«(o.E) 

and 

(a.HKo.E) = {E.H)+JdivE-t(o.EAH)+ii(B.curlE) 


= (E.H)-i{<i.EAH)- 


11 0H\ 


since 


divE = 0, and curlE = -- — . 

c dt 


Substituting these results into equation (92) we obtain 


ih 




To this degree of approximation we may write (a. 11)72 wi for 
so that we have finally for the Hamiltonian § the equation 




(o.EaH)- 

ih 


4m®c* 


(E.H) 


hi 0H\1 


where denotes the Hamiltonian operator of the non-relativistio 
Sohrodinger equation, viz. 

and fc(a.H)/2mc is the energy produced by the spin of the electron 
in an external field. The term 

is a relativistic correction term. The fourth term 

is the term attributed by Thomasf to the spin orbit coupling. The 
remaining terms have^no counterpart in the non-relativistic theory. 


5072 


t L. H. Thomaa, Phil Mag. 3, 1 (1927). 
Y 
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S6. Solutions of Dirac’s equation 
56.1’. Rectilinear motion of an electron 

We now consider the motion of an electron parallel to the ar-axis in 
a field that is a function of x only. We suppose that the vector potential 
A is zero everywhere and that The wave function may 

be written in the form 

T = (93) 


Substituting this expression into the Dirac equation (9) we obtain 

Written in terms of four scalar functions (/^ = 1>2, 3,4), this 
equation becomes 

.jjdxi I W--V(x) . 

= 9. 

.j{dX2 , W—V(x) . 

^X3-*J>8X2+PsXl-»W«X8 = 9. 

^ Xa+^i>a X»-Pt Xi+'m^Xt = 9 , 

Xi-WtXi+P»X»+”^CXi = 9 . 

K we substitute the expressions 


Xi = Xi = ^1+^i 

into the second and third equations of this set of differential equations, 
we find that they reduce to 

d^t i IF-F(a;)^ k, k( i9*+f»nc\ , _ 

i W-V(x), k, kl. , _ 

0 . 

These equations can be written in the form ’ 


d(f>t i W-V{x), .k,_ 

i£a,b are chosmi according to the equations 

o = — (p,-l-troc)/fc, b == {p 2 -\-itne)lk. 


( 94 ) 
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With these values of a, 6 inserted in the expressions for xv X* 
substitute for the x’s in the first and fourth equations of the original 
set. Taking account of the equation (94) we find that both equations 
reduce; to ^ . jY-r(x)j ’■ 


-t+1. 
dx ‘ ^ 




(96) 


We have thus reduced the problem to that of solving the two first- 
order differential equations (94) and (96). When the functions <l>i and 
^2 are determined from these equations, the scalar wave functions are 
obtained from the relations 


’*2 = ^^2(*)exp^(2)22/+J’»2- 


The case of a uniform field V(x) = ax has been treated by Sauter,t 
that of a simple harmonic oscillator, V(x) = cox^, by Nikolsky,! and 
that in which V(x) is a polynomial of any degree in a;, or in 1/a;, by 
Plesset.§ 


56.2. The energy levels of the hydrogen atom 

The most important success of the Dirac theory, apart from the, 
solution of the problem K)f the spin of the electron, has been the 
derivation of tjxo fine structure of the energy levels of an electron 
moving in the Coulomb field of a single nucleus. Dirac, || using the 
methods of non-commutative algebra, showed that Sommerfeld^s fine 
structure formula for the energy levels of the hydrogen atom was a 
natural consequence of the use of the relativistic wave equation. The 
same result was later derived by Darwinff and by Gordon by more 
familiar methods; we shall follow the latter treatments here. 

The Coulomb field of an electron moving in the field of a nucleus 
of atomic number Z may be derived from the potentials 
A = 0, = Ze^lr. 

f F, Sauter, Zeite.f. Physikt 69, 742 (1931). 

i K. Nikolsky, ibid. 62, 677 (1930). 

8 M. S. Plesset, Phyt, Bev. (2) 41 , 278 (1932). 

11 P. A. M. Dirao, Proc. Boy, Soc. A, 117 , 610 (1928). 

tt C. G. Darwin, ibid. 118 , 664 (1928); W. Gordon, Zeiia.f. Phyaik, 48 , 11 (1928). 
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We substitute from these equations into equations (36); introducing 
polar coordinates r, 6, ^ defined by the equations 

X — rnmdcon<f>, j/ ~ rHin6mnif>, z — rcond, 
we obtain 

0 1 


-•I =«-"[' 




dr 


r dd^ rsin 


j0 

dx 




dr r dd rsmddtft 


]■ 


— = ooBd- — sinfl— . 
dz dr rdd 

We thus obtain for the scalar wave functions the following equations: 

cobB d i d^ 




‘j Ti— tfoe-<^|sin 


r dd rsinfli 


]%■ 

8 


/•* ^ , , 


-ificl COB 6-^-^ Bin. ^ — - 
\ dr rdd 


ih“»' 


“j T 2 — |sin B-^ - 


d . cos^ d 


d 


j)T.+ 


r dB'rBmBd<l>^ 

+ifo: (cos . sin 0 

I dr rdB 






+ 


OOS0 8 




|sin ^ 


r dB r Bind d^j 

--^^(cosfl-|-— sin0-^]'Fi = 0, 
\ dr rdd] ^ 


d , cobB d ^ i 


j)'r.+ 


dr^ r ^i^^rsin^i 

+»^(cos0-^— sin0-|-W, = 0. 

\ 0r rdd} ^ 

In these equations we substitute 

\ = %{r,d.i,)6XT?(-iWtlh) (,t= 1,2,3,4), (96) 

and express the function ^ in terms of F, f, 0, g by equations (64). 
As a result we obtain, after some reduction, the system of differential 


dr 




( 97 ) 
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('+l)7+s('''+T+“‘)' " “ 


The differential equations for the four functions /, g, Fy 0 thus fall 
naturally into two pairs. Consequently for each permissible value of W 
we can find two independent solutions for the T-function by equating 
to zero either of the pairs of functions (/, 0) or {F, g). 

The solution of the pair of equations (98) is similar to that of a pair 
(97), so we shall consider in detail only the solutions of (97) here. 
If we make the substitutions 

5 = € = mc^jWy A = mc(l-'€^)^jhy p = 2Ar, 

fir) = i(l-^)ie-iPpy-y,{p)-Mp)} I 
Qir) il+^)ie-iPrVr(p)+m i’ 
the pair of equations (97) is transformed to 

dA (, 1/ , U ,1 S 








2 


[f we assume the functions /i{p), Aip) to bo given by power series of 
the form „ «, 

A(P) = 1 <^sP‘y Aip) = 2 d,p'>, 

««=o »-0 

then substituting for the functions in equation (100) and equating 
powers of we obtain the recurrence formulae 


«. y+«+- 




for the coefficients d,. Since we are considering only positive powers 
oip we may take 5 = 0, Cj = 0 in equation (101) to obtain the equations 


(’'+ 7 ( 1 ^)®'’ (^+2 ^( 1 -**))**“ 
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The necessary and sufficient condition for non-vanishing solutions of 
the type (99) is therefore 


£ 




-i-4— ^ 
u 


= 0 . 


Regarded as an equation in y this determinantal equation has two 
roots, equal in magnitude but of opposite sign. Discarding the negative 
root since it would lead to wave functions which behave as r“" (w > 1) 
in the neighbourhood of r = 0 and hence give a divergent integral for 
J ITI® dr, we find that the value of y is given by the positive root 

y = (102) 

Since j > 1 and J = Z{e^lhc) = Z/137 < 1, it follows that y is always 
real. 

Eliminating dg from equations (101) we obtain the recurrence relation 

{Y+^)^-U+iW 

n'-l 

If the function fi(p) is to be a polynomial of the type 2 which 

the last term does not vanish, then we must have 


y+w'- 


= 0 . 


Regarded as an equation in e this equation has roots 

e= I y+”' 

V{£*+(y+™’)*} 

Taking the positive root of this equation and defining the principal 
quantum number, n, by the equation 

n = n'+j+i, 


we obtain 


W = mcm 




4 


(103) 


Subtracting the rest mass of the electron and substituting for £ and y 
wo obtain Sommerfeld’s fine structure formula f ' 


W 




LI 


i+f 






For small values of Ze^jhc this formula reduces to the-non-relativistic 


formula 


F' = 


Z*me« 1 
2«* n*' 
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The analytical forms for the function8/i(/)) and/ 2 (p) can be determined 
^sily; it is found that when Tf is given by equation (103) and /a are 
given by the formulae 

A(p) = iF,(l-n',2y+l-,p) 

Up) = -y±!^±^^F,(-n',2y+l-,p) ’ 


n€ 


(105) 


where the hypergeometric function is defined by 

)-Zr(«)r(iS+«)n!i- 

If we substitute for /i, /a from equation (105) into equation (99) and 
transform back to the original variables, it can be verified easily that 

l^l>l/|. (106) 

The solution of the pair of differential equations (98) may be carried 
out in a similar way. If we write 

and proceed as before, we find that the condition (102) and the 
Sommerfeld formula (103) are obtained once more, and that 

Corresponding to equation (106) we then have the relation 

1^1 > \gl ( 106 ) 


56.3. Motion of an electron in a uniform magnetic field'\ 

Consider the motion of an electron moving in a uniform magnetic 
field H in the 2 -direction. Then the potentials are given by 
<^ == 0, A ^ 0), 

so that the Dirac equation (43) reduces to 

where we have written w = e£r/2c. If we assume that the wav^ function 
T* is independent of z and write 


we find that ^ satisfies the equation 

t L. D. Huff, Phyt. Rev. (2), 38, 601 (1931). 
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Substituting x' = a5+ij/2w and writing out the four component equa- 
tions we obtain the system of ordinary differential equations 


*^-f-2uoa:'^,-|- mc)^j = 0, 


Eliminating between the first and last of this set of equations and 
writing ^2 

we obtain the differential equation for a harmonic oscillator 


^+(v+i~iW 4 = 0 , 


(109) 


whose solution may be written (cf. Appendix, § 2) 

where 5 ^,( 1 ) is the Hermite polynomial of order v. 

Similarly if we eliminate ^4 between the first and last equations of 
the set we obtain = ftA-i(f)* The two solutions are not, however, 
independent. Substituting the two expressions for ^1, ^4 into the first 
equation of the set we obtain a relation between a, 6. In this way we 
obtain 

Ux') = aDM = 

as the expressions for the functions and ^4. 

It can be shown in a similar manner that 


U<^') = iC, 

where C is an arbitrary constant. 

■ The current densities in the field are given by / 

y, = == -(l+cV)(^iV4+«?ii). 

jy = -YSY = 


= (l-fcV)oa*D,jD[,_i 


F/c-^ibc 
(&«)* ’ 




where 
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From the properties of the i>-funotionB it follows that jy is negative 
for some values of x\ this is apparently a consequence of the spin, since 
the corresponding expression in the Schrodinger theory turns out to 
be always positive.f 
Using the results (cf. Appendix, § 3 ) 


J A(^) A-i(f) di = J 

0 0 ' ' 

we find that the average a;-coordinate of the current defined by 

jydx 

has the value x — 1/— 2 >>( 27 r)>. ( 110 ) 

4 \a>/ v! ' 

Now the classical value of this quantity is 

X = 7 rr/ 4 , 

where r = cpjeH, The quantum mechanical value reduces to this if we ^ 
use Stirling’s formula 

n\ = (27m)* 

The error in the Stirling formula is of the order Ijv, i.e. 10 -® to 10 "^®; 
so that the deviation of the classical formula from that given by the 
Dirac theory is entirely negligible. 




56.4. The scattering of fast electrons by a nucleus 
If a beam of electrons with velocity comparable to that of light falls 
on a single atomic nucleus, Dirac’s equation of the electron must be 
used to calculate the scattering. We have in fact to find a solution 
Y = (Yi, Yg, Yg, Y 4 ) of this equation which represents an incident plane 
wave and a scattered wave, and thus which has the asymptotic form 

piprlh 

(i=1.2,3.4) ( 111 ) 

for large values of r. The constants Cjj. are not all arbitrary constants; 
it follows from equation (67) that if we take 

= Af == 

where A, are arbitrary complex quantities, then 

Q.-^. 0,-^. 

jp+mc * p+tnc 

t G. E. UUenbeok uid L. A. Young, Pky$. Bm. 36, 1721 (12S0). 


(112), 
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where we have written p for WjC, We shall choose j 4 , B such that 
AA^+BB* = 1. As already noted the scattered current depends on 
and only, so that, if mid ddd^ is the proportion 

of the original beam scattered in a given solid angle, we have 

/(«)= W9,^)12+K(^?,^)12. 

Furthermore, if we take the direction of the spin to mean the direction 
referred to axes with respect to which the electron is at rest, then 

-2 = e^cotix, ( 113 ) 


where x, w are the spherical polar angles of the spin direction. 

Now it can be shown that the general solution of the form (111) is 

«,(<», = Af(e)-Bg(e)e-<4’ 

= Bm-\rAg(e)e*^ 


(114) 


where f(Q) and g(d) are functions of 6 alone and depend on Ibe form 
of the potential function V(r) describing the interaction between the 
electron and the nucleus. Thus 


m = {AA*+BB*){f(e)f*(e)+g{e)g*{e)}+ 

+{f{B)g*(e)-f*(d)g{m^*^<^-^-^B*e% (116) 
so that if J? satisfy the equations (112) and (113) we haye 

m) = (ne) 

where the function D(d) is defined by the relation 

m ^ i{f{mo)-f*mB)} ( 117 ) 

and'X) ^ determine the direction of the spin axis of the incident 
electrons. 

If the incident beam is unpolarized so that the spin axes are pointing 
in all directions, we average over all possible directions of the spin axis 
and obtain 1 = \f(6)\*+\g(0)\*. (118) 

In the general case the function P will depend on the polarization of 
the incident beam, and even if the incident beam is uiipolarized the 
scattered beam will in general possess a polarization. / 

The functions /(ff) and g{d) for scattering by an atomic nucleus with 
inverse square law field 

V(r):=^Ze^lr 

may be determinedt from Gordon’s solution of the Dirac equation for 


t Cf. N. F. Mott, Proc. Boy. Soc. A, 124, 426 (1929). 
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the hydrogen atom (§ 56.2 above) and from the asymptotic expansion 

r m 


F(a;P;z)- 


np-cc) 

r(o<) 


(-z)-“ («(i 3 - 2 a)> 0 ) 
(^(|8-2a) < 0) 


and |arg(— 2)1 < tt, |arg2| < it. For the details of the calculation the 
reader is referred to the original paper; we shall only give the result 
here. The functions f(d) and g(d) are such that 

m = \mn\9{ew 

= ^^(l-^“)|cosec* J^-jS^cosec* je+iroij3^M.|.0(a2)| (119) 

and f 9 *—f*g = -^(l~'i8^)*^*a cosec 01og(cosec|^), (120) 

where m, v, e denote the mass, velocity, and charge of the electron, Z the 
atomic number of the scattering nucleus, and a = Ze^lhc, j8 = v/c, and 
Tq =: e^lmc^. 

The formulae ( 119 ) and (120) determining the total scattering and 
the polarization of the scattered beam are calculated with neglect of 
radiative forces which are appreciable for fast electrons. 

If we let jS->0 in equation ( 119 ) we obtain the non-relativistic 
formula , „ . >3 

Now the number of particles N scattered in the angular range 
(^1,^2) is 


so that substituting for I ( 6 ) from ( 119 ) we obtain the formula 

+ 277 aj 8 (sin sin i^a+cosecj^i— cosec (121) 

which reduces to the well-known result 
\mv^l 

in the non-relativistic approximation j8 -> 0. 
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For large values of Z and small angles, 6, we have approximately 

For large values of Z and large angles there is no simple formula for 
the scattering cross-section. The scattering has, however, been calcu- 
lated for gold at a scattering angle of 90® and for various velocities. 
We can express the scattering at 90® in the two forms 




(122) 

or 


(123) 


where R and 8 are numerical factors which are functions of vjc. The 
values of R, 8 are given in tabular form below: ' 


Table 9 


$ 

0-2 

0-3 

0-4 

0‘5 

0-6 

or 

0-8 

0‘9 

J ‘0 

J2 

M 

12 

1-4 

1-6 1 

1-9 

2-2 

2-6 

30 

3*4 


1-0 

M 

M 

1-2 

1-2 I 

M 

0-9 

0*6 

0 


There have been a number of experimental investigations of scattering 
of fast electrons by nuclei, mostly by the method of the expansion 
chamber. A review of the position in 1938 was given by Champion.f 
For more recent work, see RandelsJ and co-workers. Scattering has 
been investigated in H, N, F, A, Kr, I, Xe, and Hg, for energies usually 
up to about 3 Mev., though for xenon recent investigations have gone up 
to 12 Mev. Except for some early work on mercury, discrepancies with 
theory are seldom as great as 2, and may be due to experimental error. 
However, there is little work on large angle scattering with heavy 
elements. 

56.6. The polarization of electrons by double scattering 
In §53.4 we saw that formally the electron has a magnetic moment 
of magnitude eS/2mc. When the electron is bound in an atom this 
magnetic moment cannot be observed directly; all that can be observed ^ 
is the magi^tic moment of the atom as a whole, or the splitting of the 
spectral lines which we may attribute to the magnetic moment of the 
dectron. It is natural then to inquire if a free electron possesses a 
magnetic moment and if there is any experiment by which we may 

t F, C. ChMupion, Sep, Prog, Phyt, 5, 366 (1838). 
t B. Q. Randds, E. T. Chao, and H. B. Crane, Phye, Sev, 68, 66 (1946). 
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determine this moment. It can readily be seen that a magnetic moment 
ehl2mc can never be observed directly, for instance, by the use of a 
magnetometer.f Suppose, for example, that the free electron whose 
spin is to be observed is at a distance r from a magnetometer. Then 
the magnetic field due to the spin moment is of order 

eS/mcr® 

and the magnetic field due to the motion of the electron is of order 

ev 1 
c 


If the effect of the spin is to be observable it must be greater than 
the uncertainty in the field due to the electronic motion, so that we 
must have > ^ 


— >At;, 
mr 


or 


mAv 


>r, 


(124) 


where Av is the uncertainty in our knowledge of v. Now by the un- 
certainty principle we cannot observe both r and v at the same time; 
the uncertainties of our knowledge of r and v are related by the equation 

ArAv > hjm, (126) 

Substituting from this equation into equation (124) we find 

Ar > r, 

showing that such an experiment will be impossible, since the un- 
certainty in the position of the electron under observation would exceed 
the distance of the electron from the magnetometer. In other words, 
the uncertainty in the field we want to measure would have to be 
greater than the field itself. The only way in which we can hope to 
detect the magnetic moment of a free electron is by obtaining Si, polarized 
beam, that is, a beam in which all the spin axes are pointing in the same 
direction or at any rate more in one direction than another. A Stem- 
Gerlach experiment would appear to be the obvious way of producing 
such a beam of electrons, but here again it can be shown by means of 
the uncertainty principle that this is impossible. It appears certain 
that the magnetic moment of a free electron can never be observed 
in any experiment based on the classical conception of an electron 
magnet. 

The idea that an unpolarized beam of electrons (i.e. one in which the 
spin axes are pointing in all directions at random) scattered from a 

t The argument is due to N. Bohr; of. N. F. Mott and H. S. W. Massey, Th^ry of 
Atsmic CoUiaionaf p. 42 (Oxford, 1933). 
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target might be partly polarized was first put forward by Mott.f This 
polarization could be measured by allowing the scattered beam to fall 
on a second target. By projecting an unpolarized beam of electrons 
on to a foil thin enough to ensure single scattering, and allowing some 
of the scattered electrons to fall on a second thin foil, we should be 



Fig. 66. Experiment for producing and detecting electronic polarization. 

able to detect an asymmetrical scattering ubout the direction in which 
the beam falls on the second target. The theory of such an experiment 
was developed by Mott,f making use of the Dirac wave equations for 
the electron. 

Suppose the initial beam LT^ (cf. Fig. 65) is represented by 
% = T4 = 

then the direction of the spin axis of the scattered beam is determined, 
according to equation (113), by the ratio of the amplitudes of the two 
components of the wave function of the scattered beam which, according 
to equation (114), are 

Ag{e^)+Bm^ 

where is the angle between the incident electron beam and the 
scattered beam. Rotating our axes through an angle B-y so that Ty 
becomes the axis of z we find the beam of electrons is represented by 

% = [{mi)-Bg(By)}coBiBy+{Ag(^^^^ 

% - [{Ag(By)+Bf(By)}cosiBy--{Af^^^^ 

By making use of equation (115) and averaging over all directions of 
the spin aids of the initial beam LTy, we can obtain the number of 
electrons scattered by the second target in a given direction (d 2 ’^ 2 )* 
We are interested p^arily in the asymmetry in the scattering about 
the line TyT^. For prescribed values of By and B^ a straightforward 

t N. F. Mott, Proc, Roy, Soe, A, 124, 426 (1929) ; 135, 429 (1932). 



§ 66 SOLUTIONS OF DIRAC’S EQUATIOi^f 335 


calculation shows that the number of electrons scattered per unit solid 
angle is proportional to i . 

^ 1-8cos^2» (126) 

where 



(127) 


— v/c- 

Fig. 66. Theoretical values for the polarization ratio for gold, 
together with an experimental point. 

By means of these equations and the equations (119) and (120) derived 
for/(0), g{B), we can estimate the order of magnitude of the effect to 
be expected in the double-scattering experiment outlined above. If 
we take both of the angles 61 and 62 to be 90° we have approximately 

= (* 28 ) 

Differentiating the right-hand side of this equation with respect to p 
and equating it to zero, we find that 8 has a maximum when jS = 0-764; 
for this value of the velocity of the incident electrons we obtain ' 



Calculations of 8, without using any approximation, were carried out 
for gold {Z = 79) by Mott (loc. cit.) and repeated by Bartlett and 
Watson,f who have calculated somewhat more accurately the rather 
slowly convergent series involved. Massey and Mohrf have made the 
calculation for the self-consistent field of the gold atom. These results 
are shown in Rg. 66, compared with the experimental ratio obtained 


t J. H. Bartlett and R. E. Watson, Phys. Rev. 56, 612 (1939). 
j H. S. W. Massey and C. B. 0. Mohr, Proe, Boy, 8oe. A, 177, 341 (1941). 
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by Skull, Chase, and Myers.f No asymmetry was obtained for alumi- 
nium, in agreement with theory. 


57. The relativistic wave equation for two electrons 

57.1. Eqmtions of Eddiri^ton and Breit 

Until now we have considered only the relativistic motion of a single 
electron moving in an external field, and no attempt has been made 
to discuss the interactions between two or more electrons. At the time 
of writing no complete relativistic theory has been formulated for 
the discussion of problems in which the interaction of two or more 
elementary particles is taken into account. Such a theory depends on 
a proper treatment of the interaction of a number of charged particles 
with an electromagnetic field. The difficulties encountered in formulat- 
ing a completely satisfactory theory of quantum electrodynamics appear 
at the moment to be so formidable that their resolution may be expected 
only along lines rather different from those so far attempted. For this 
reason quantum electrodynamics will not be discussed in this book;J 
we shall merely indicate the results relating to the interaction between 
two electrons which have been derived from this source. 

The first attempts to set up a relativistic wave equation for two 
electrons, similar to Dirac’s equation for a single electron, were made 
by Eddington§ and 6aunt|l. In the classical theory the energy of 
interaction between two electrons is 


g2 yl y2 

^12 L ^ J 


where v^, v* denote the velocities of the two particles and the distance 
between them. The first term of this expression refers to the electro- 
static energy and the other to the magnetic. Gaunt and Eddington 
replace this energy by 2 


and so obtain the wave equation 


0- [no+ JWn}+c^n|)+(al+aJKc+~(a^a*^^^^ (130] 

where the wave-function T has sixieen componelits which may be 
denoted bjf . (m,n = 1,2,3.4), 


t C. a. Skull, C. T. Chase, and F. E. Myers, Phya. Rw, 63, 29 (1943). 

X For a disoussion, see P. A. M. Dirac, Quantum Electrodynamipai Comm. Dub. Inst, 
Ado. Stud. A, 1 (1943) ; A, 3 (1946). 

$ A. S. Eddington, Proc. Roy, 80 c. A, 132, 358 (1929). 

li J. A. Qaunt, Pm. Roy. 8 oe. A, 123, 613 (1929); Phil. Trans. A, 228, 151 (1929). 
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the suffix m referring to the electron denoted by 1, n to that denoted 
by 2. The matrices a}, a|, are the usual Dirac matrices, but now they 
operate on the first and second suffixes respectively. As before 






(ISI) 


f A^ denoting the potentials of electrons 1 and 2 disregarding their 
interaction, and 


Ho = 

“ ic ’ er’ 


(132) 


Ho = (133) 

It was, however, shown by Breitf that the consequences of equation 
(130) are unreasonable. Making use of the quantum theory of the 
electromagnetic field developed by Pauli and Heisenberg, Tireit has 
investigated the form of the Hamiltonian for two electrons. The wave 
equation corresponding to this Hamiltonian is found to be 

• j^no+aMIi+aMP+(ai+a£j)moC4- 

+ = 0. (134) 

This equation of Breit's is not exact. It does not hold when the 
velocities of the electrons are too great or when the distance between 
them is too large, since the potentials are not retarded. With a slight 
modification, however, this equation has been shown to explain, with 
a fair degree of accuracy, the observed facts of the hyperfine structure 
of the helium spectrum. J 

The Breit equation (134) has been derived by Nikolsky§ using Dirac^s 
theory of quantum electrodynamics. || In this theory the Hamiltonian 
of the system of two electrons is the simple sum of the two Hamiltonians 
for the single electrons. No interaction terms are required in the 
Hamiltonian, the interaction being introduced by a wave-function T 
which is a function not only of the coordinates of the two electrons 
but also of the variables describing the field. The interaction energy 
follows as a consequence of the quantization of the field. 

The Breit equatibn for two electrons is satisfactory only when cor- 
rections of the order (v/c)* to the non-relativistic terms are required; 


t G. Breit, Phya. Rev. (2), 34, 653 (1929). 

t Ibid.; 36, 383 (1930); 39, 616 (1932); 31, 248 (1937); 53, 153 (1938). 
§ K. NikoU^, Fivya. ZeUa, d, Sou^Uunion, 2, 447 (1932-3). 

II P. A. M. Dirao, Proc, Boy. Soe, A, 136, 453 (1932). 
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V denotes the velocity of either, electron. It is fortunate that an equation 
correct to this order should be sufficient for calculating the stationary 
states of two extranuclear electrons and that the present gaps in our 
knowledge of quantum electrodynamics do not make the calculation 
of spectroscopic energies impossible. It appears that the approximation 
to terms of order vjc is sufficient for the rough treatment of energy 
levels while the approximation to {vjc)^ apparently gives a satisfactory 
account of their fine structure. 


57.2. The correspondence method of Holier 

We have noted above that the general theory of the interaction of 
two electrons can only be based on a proper theory of quantum electro- 
dynamics. An important class of problems can, however, be solved in 
a more elementary way. The method is due to MoUer.f 

In these problems one considers two electrons initially in states with 
wave-functions %(r), These are taken to be Dirac functions, and 
thus correspond to definite spin directions. We require to calculate 
the probability that a transition occurs to a state in which the two 
electrons are in states with wave-functions TJ, xy The transition occurs 
under the interaction of the electrons alone. Using non-relativistic 
quantum mechanics it is of course possible to calculate the transition 
probability to any power of the interaction. The relativistic theory 
given here gives an answer to the first power only, and is adequate 
when this is sufficient. 

The method is as follows. First one forms the charge and current 
densities associated with the transition of one of the electrons, in the 
sense of Chapter X, § 46. Using the Dirac theory, these are 


p =z py^e"*^^<^^-fcomplex conjugate, 
j = complex conjugate, 

where v^f denotes the transition frequency (HJ— and 

ift = ceYfaY,. 

One then writes down the scalar and vector potentials of the electro- 
magnetic field that such a transition would produce. These are given, 
as in § 46, by the equations 


(136) 


(136) 




( 137 ) 


t C. Mailer, Zeks.f. Physik, 70, 780 (1931); Ann. der Ph^aiJc, 14, 631 (1932). 
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We take the solution corresponding to retarded potentials (i.e. to an 
outgoing wave). These are 

j dr 

Ir-r'l 

^ J Ir-r'l 

We then allow these potentials to act on the second electron, and 
calculate by the ordinary methods of perturbation theory the probability 
per unit time that the electron will make a transition to a state Xf> 
Owing to the term e-2^%Mn the formulae (137 a) for 0 and A, the 
transition probabihty will be zero unless energy for the whole system 
is conserved. 

In the Hamiltonian for the second electron the perturbing term is 
(cf. equation (19)) 

c 

To calculate the probability that after a time t a transition has taken 
place from the state Xi to state Xf we make use of formula (10) 
of Chapter X. This in our case may be written 

P(p| JJ 

(138) ' 

It is here assumed that all the wave functions are normalized to unity 
and that p(W)dW is the number of states of the final system in the 
energy interval dW, 

This formula cannot be correct, because if we make c -> oo (non- 
relativistic case) it does not tend to the correct non-relativistic result, 
using antisymmetrical wave functions. We must therefore replace it by 

Xe2«'>’/<i'i-^i/<>[T<(ri)xi(r2)-’Fi(rij)xi(ri)]d!TidT2 . (139) 

This expression has the correct symmetry; but it will be appreciated 
that the derivation does not depend on any very consecutive argument. 

57.3, GoUisim between two fast charged prtkks 

The method outlined above was first applied by Mellerf to the 
discussion of the non-radistive collision between two free electrons. 

t Loc, cit. 
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The accuracy of MoUer’s calculations is of the same order as that of 
the first approximation in Bom’s method, so that for fast particles his 
results neglect terms of order e^Jhc in comparison with unity. They 
should, however, be exact to all orders of v/c. 

In his treatment of the problem MoUer refers the scattering to a 
Lorentz frame of reference in which the centre of gravity of the two 
electrons is at rest and their momenta are equal and opposite. The 
angle of scattering, 0*, in this frame is related to the observed angle of 
scattering, 0, by the formula 


0* = COS”^ 


2— (y+3)sin^0 
2+(y— l)sin20' 


(140) 


where y is given in terms of the relative velocity of the two electrons 
before collision by the equation 



For Yf, Xii Xf Roller takes plane waves. He finds that the cross- 
section for scattering between aoigles 0, 0-f ci0 is 




; + 


(y-if 




where we have written, for convenience 


(141) 


jl = COS0*. 


(142) 


This formula, it will be noted, contains no terms in the Planck 
constant. 

It is of some interest to compare Holler’s formula (141) with others 
of a similar nature. For low velocities equation (141) reduces to the 
form 

I (0) (Z0 = 47r |(coseo^0+ sec*0— cosec20 sec*0) — 

— ^ (4 ooseo*0-f 3 seo*0-— 2 oo8ec*0 sec®0— 3 sec^0) | sin 20 d6, 

. ^ ( 143 ) 

Now the corresponding formula for the scattering of jS-particles 
(fic < 1) by electrons is (cf. § 24.4) 


1(6) dd 47r|-~^j {cosec^0-fseo*0 


— coBec®0 sec*0 cos r7}sin 20 dd, 

(144) 

(145) 


where 


U = ^iogcot0. 
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If e^/hc is small enough to allow us to put [/ = 0, while v/c ia small 

enough to be neglected, the two formulae are equivalent. This is the 

if 1 /I ^ ^ , 

1/137 < v/c < 1. 


The scattering of fast jS-rays by electrons up to angles of about 12® 
has been investigated experimentally by Williams and Terrouxf and 
by Williams.! Instead of measuring the angles of scattering they 
measured the energy of the recoil electrons; for v/c ranging from 0-60 
to 0*97 along a track of length 18 metres they observed about seventy 
branches with energies between 7,600 and 40,000 volts. This number 
they compared with the effective cro8s-section§ for loss of energy 
between Q, Q+dQ, namely. 


27r 


^ 


which may be deduced from Moller’s formula for small angles. At the 
higher velocities the observed scattering was roughly twice that pre- 
dicted by this formula, but the ratio diminished as vjc decreased. 

A more direct attempt to verify Moller’s formula (141) has been 
made by Champion, || who from the analysis of about 650 m. of track 
of fast ^-particles in nitrogen, photographed by the expansion method, 
obtained 260 collisions with atomic electrons in which the angle of 
scattering lies between 10® and about 30°. The values of the ratio vjc 
ranged from 0-82 to 0*92. The number of particles scattered into the 
ranges 10®-20°, 20°“30°, > 30° was measured and the corresponding 
theoretical values obtained from Moller’s formula by numerical integra- 
tion. The results are shown in Table 10. 


Table 10 


Angular 

range 

Number scattered 

Observed 

Calculated 

> 30° 

10 

13 

20°-30° 

26 

30 

10°-20° 

214 

230 

Total 

260 

273 


It will be seen that the observed values and those calculated from 
Holler’s formula are in good agreement; it should, however, be noted 

t E. J. Williams and F. R. Terroux, Proc. Roy. 8oc. A, 126, 280 (1929). 
i E. J. Williams, ibid. 130, 328 (1930). 

§ N. Bohr, Phil. Mag. 25, 10 (1913); 30, 58 (1915). 
i| F. C. Champion, Proc. Roy. Soc. A, 137, 688 (1932). 
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that most of the observed scattering angles were less than 30® when the 
contribution of the spin to Holler’s formula is very small. 

The theoretical analysis has been extended by MoUerf to the case 
when a quantum is emitted in the collision of two fast charged particles. 
The cross-section for such a process is calculated first by means of the 
correspondence method of Holler and then by means of the quantum 
electrodynamics of Heisenberg and Pauli, J and Dirac ;§ the same result 
is obtained by both methods. The formula for the case in which one 
of the colliding particles is infinitely heavy corresponds to that derived 
by Bethe and Heitler|| for the radiation emitted in the collision of a 
fast charged particle with an'atomic nucleus. 

The methods of quantum electrodynamics have been employed by 
Hulmeft to investigate the interaction of a particle in the nucleus of an 
atom with an electron in the Jl - shell (a case of some interest in the 
theory of internal conversion of y-rays). Hulma’s results show that for 
bound particles Holler’s correspondence method gives the correct result 
if we take the retarded potentials due to the transition where the first 
particle jumps down, and then consider these as perturbing the second 
particle. ’For free particles, however, we may take either the advanced 
or the retarded potentials, or we may put the results in a symmetrical 
form by taking the mean of the two, without affecting the result. 

57.4. The relativistic theory of the Auyer effect 

If a X-electron is ejected from an atom, an outer electron, o.g. an 
L-electron, can fall into the Z-shell giving up its energy in one of two 
ways: 

(а) By radiation, a process which gives rise to the X-ray emission 
lines. 

(б) By transferring its energy to one of the outer electrons, which is 
thereby ejected from the atom. This process is known as the 
Auger effect. Experimental investigations have been made of 
the ratio between the number of y-quanta and the number of 
Auger electrons. 

In calculating the probability of the Auger transition, one treats the 
process as a transition to a state of equal energy of the system com- 
prised by the two electrons. The formalism is thus exactly that of the 
foregoing section, and one uses equation (139). 
t C. Malter, Proe. Boy. Soc. A, 152, 481 (1935). 

t W. Heisenberg and W, Pauli, Zeits.f, Phyaih, 56, 1 (1929) ; 59, 168 (1929). 

§ P. A. M. Dirac, Proc. Roy. Soc, A, 136, 463 (1932). 
li H. A. Bethe and W. Heitler, ibid. 146, 85 (1934). 
ft H. R. Hulme, ibid. 154, 487 (1936). 
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The probability of the internal conversion of the iT-series radiation 
for gold arising from the interaction of the Lj Lj, Lj and Lj Lm 
shells has been calculated by Massey and Burhopt in this way. Two 
approximations arise in the course of the calculations: 

(i) The problem is treated as one of two electrons only, the perturba- 
tion by these of the wave functions of the other atomic electrons 
being neglected. 

(ii) Screened hydrogen-like wave functions are used for the atomic 
electrons. 

The second approximation is probably quite accurate, since it is 
known that it gives satisfactory results in the discussion of the non- 
relativistic theory of the Auger effect. J The error in the first approxi- 
mation cannot be estimated until the wave functions of electrons in an 
atom ionized in an inner shell are found (say, by the method of the 
self-consistent field). 

Details of the calculation are as follows: 

Suppose that an electron in the L-shell falls into an empty A-level 
and that the energy given up raises a second L-electron to a state of 
the continuous spectrum. If we denote by TJ, 'I/ the Dirac wave func- 
tions t)f the initial and final states of the ejected electron and by Xf 
those of the other electron, we find that if TJ is normalized to represent 
one emitted electron per unit time the number of transitions per unit 
time is given by i , 

In this equation and a are the usual Dirac operators 
defining a, ^ are analogous to equations (137a), viz. 


(146) 
The equations 




a j ^ Q2rrivfi\r 


(147) 


Substituting from equations (147) into equation (146) we obtain for 
the probability per unit time of a transition 




( 148 ) 


where we have written, for convenience, 

j = ex7/»i®X<. 

/ = j' = 

t H. 8. W. Massey and E. H. 8. Burhop, Proc, Roy. Soc, A, 153, 661 (1936). 
t E. H. S. Burhop, ibid. 148, 272 (1935). 
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From these definitions it follows immediately that (pj) are time- 
independent charge density and current vector for the transition, 
and that (p,y) are the same quantities for the L-^co transition 
(oo denoting a state of the continuous spectrum). 



The formula (148) does not take the Pauli exclusion principle into 
account. To do this we introduce the quantity analogous to (148) 
but with the final states interchanged. The possibility of electron 
exchange is then taken into account if we take the total probability 
of occurrence of the double transition per unit time to be 

\Afi-Bfi\K (149) 

The application of formulae (148) and (149) to the calculation of the 
probability of the Auger transitions ij, Lj X, oo; Lj, Ln X, oo; 
Li, Liii has been carried out by Massey and Burhop for gold. 

The detailed calculations show the relativistic effects to be of con- 
siderable importance for gold; the X-series internal conversion coefift- 
cient being increased by a factor of twb. The relative intensity of 
the above .three transitions is given in the non-relativistic approxima- 
tion by the ratio 1 : M4: 2*28, while the relativistic theory gives a ratio 
1:6*5: 5*3. If these. conclusions were tested experimentally we should 
have some insight into the validity, ,not only of Dirac's equation for 
the single electron, but also of its approximate extension to two 
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and that the energy given up raises a second L-electron to a state of 
the continuous spectrum. If we denote by TJ, 'I/ the Dirac wave func- 
tions t)f the initial and final states of the ejected electron and by Xf 
those of the other electron, we find that if TJ is normalized to represent 
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(146) 
The equations 




a j ^ Q2rrivfi\r 


(147) 


Substituting from equations (147) into equation (146) we obtain for 
the probability per unit time of a transition 




( 148 ) 


where we have written, for convenience, 

j = ex7/»i®X<. 

/ = j' = 

t H. 8. W. Massey and E. H. 8. Burhop, Proc, Roy. Soc, A, 153, 661 (1936). 
t E. H. S. Burhop, ibid. 148, 272 (1935). 
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probabilities 6^, and the total probability per unit time of the 

ejection of an electron from any shell is 

b = • 

The internal conversion coefficient is then defined by 

b 

since b+g is the total probability per unit time that the transition takes 
place. The internal conversion coefficients for the various levels are 
defined by 6 6 

etc., and the ratio of the number of observed y-rays to the number of 
X-electrons is 

1-a* 


The choice of the name ‘internal conversion coefficient’ is unfortunate 
since we now regard the emission of electrons as due to direct inter- 
action between the nucleus and the atom rather than to the emission 
and subsequent reabsorption of a y-ray. The relativistic theory includes 
the effects of direct interaction as well as those of y-ray emission and 
reabsorption; in fact, no sharp distinction can be made between them. 

In making the calculations, the unperturbed system is taken to be 
an electron under the influence of a central charge Ze. We then have 
A = 0, Ao = Ze/r. (150) 

To determine the perturbing potentials we must make some assumption 
about the field which is to represent the y-ray emitted by the nucleus. 
Suppose the perturbing field is represented by 

Ao == I 


and suppose that Tq represents the normalized wave function of an 
extranuclear electron for the ground state, and Ty that of a possible 
final state, determined by Dirac’s wave equation (19) with A and 
given by equation (161). Then the probability per unijb time that an 
electron is ejected because of the field (161) is given by 




■e9lo~e(a./)i0)}%(JT 


(162) 


where ^ is normalized so that one electron crosses unit area per unit 
time and the summation is taken over all possible final states. The 
functions %, T, have already been determined (§66.2 above) so that 
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once ^0 and # have been prescribed the problem reduces to that of 
calculating the matrix elements occurring in the series (152). 

The number of possible fields (151) is, of course, infinite; the various 
types considered with the corresponding forms of M, s^^e exhibited 
in Table 11. In this table -B is a constant and q denotes 27rv/c. 

For the dipole radiation the energy radiated by the field is 


9o 




(153) 


so that for dipole radiation and Z-conversion we obtain 

7 " 


(154) 


where Yq is the wave function of the Z-state. Since both and |3l| 
are proportional to B, the unknown constant B is not involved in this 
expression. Taylor and Mottj showed that the number of quanta 
escaping from the atom differs from dt (the number of quanta ejected 
in the absence of the outer electrons) only by a factor of order of 
magnitude 1/137 which may be neglected, and is not (gQ—b)dt as was 
previously supposed. The effect of the extranuclear electrons is thus 
to increase the total nuclear transition probabihty per unit time. 

The evaluation of the matrix elements involved in equation (154) has 
been carried out using the wave-functions TJ, %, which are the exact 
solutions of the relativistic wave equation for an unscreened nuclear 
field of potential Zejr. The possible sources of error are thus: 

(i) the assumption that the dipole, quadripole, or other field used 
is the correct one; 

(ii) the neglect of screening; 

(iii) the use in the integrations (162) and (164) of the dipole and 
quadripole forms for the field of the y-ray at all distances from 
the centre of the atom; this is obviously incorrect within the 
nucleus (r < 10"^^ cm.). 

The second of these errors is probably not important since the energy 
of the ejected electron, even for soft y-rays (^ 150 kV.) is always much 
greater than the difference between the screened and unscreened 
potential energies anywhere in the atom. The third source should not 
introduce an error greater than the small fraction 

radius of nucleus 

K = — — — 7 , ( 166 ) 

wave-length of y-ray 


unless the field within the nucleus is unexpectedly great. 


t H. M. Taylor and N. F. Mott, Proc. Roy, Soc, A, 142 , 216 (1933). 
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Conversion in the K-shelL The results of the calculations for dipole^ 
quadripole, and, magnetic dipole radiation are shown in Fig. 68. 
Magnetic quadripole and octopole radiation give considerably higher 
values. Experimental points are shown on the same diagram. 




Fig. 686. 

Conversion in the L-shelL The internal conversion coefficients for 
quadripole and dipole radiation are identical for very hard radiation 
(hv -> 00 ). For dipole radiation Hulme has shown that in this limit 

• “Li • “in • ®£ni “ 6*7 : 1 : 0*0086 : 0*044 
and that for hv = 7*2 x 10® eV. 

“jr-axi = 7*0:1. , 
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Fiflk ha^ shown that for quadripole radiation this ratio remains 
practically unaltered down to energies of order hv == 2*5 x 10® eV» 

The ratio oL^loiLi estimated by EUisf for a number 'of lines 

and found to be roughly in agreement with the theoretical value 7*0. 
Gray and 0 'Leary { find that for RaD in the ultra-soft region 
(Av = 0*475x10®) 

= 2 - 2 . 

For dipole radiation the theoretical value is 1*8 and for quadripole 2*75. 

The experimental determination of the internal conversion coefficient 
is open to appreciable errors, but the data seem to exhibit the following 
features: 

(i) The experimental values plotted against hv may be fitted to two 
smooth curves. 

(ii) These two curves have the same general form as the theoretical 
curves for dipole and quadripole radiation, but in the soft y-ray 
region (hv < 3. 10®eV.) the theoretical values are more than 50 
per cent, too small. 

A detailed analysis of the experimental results for elements of odd 
mass number shows that for ordinary dipole transitions (AZ = 1) the 
magnetic quadripole terms are weak in comparison with the electric 
dipole terms, the ratio of the intensities being of order /c*, where k is 
defined by equation (155). Even for the hardest y-rays this ratio is 
only about 10“^, so that we should expect transitions for which AZ = 1 
to have almost exactly the theoretical coefficient derived on the assump- 
tion that the field is due to a dipole. In the quadripole transitions 
(AJ = 0) the intensities of the magnetic dipole and electric quadripole 
terms are of the same order, while the magnetic octopole terms are 
negligible. The ratio of the intensity of the magnetic dipole terms to 
the electric quadripole terms varies roughly as the square of the wave- 
length of the y-ray and consequently increases for softer y-rays. Thus 
if the soft y-rays arise from transitions between the various states of 
a multiplet level, then the internal conversion coefficient in the region 
of long wave-lengths approximates to the theoretical magnetic dipole 
internal conversion coefficient curve, and in the regi6n of short wave- 
lengths shoidd fall below this, and lie near to the corresponding 
theoretical curve for electric quadripole radiation. 

t E. Rutherford, J. Chadwick, C. D. EUis, RadicUiona from Radidaotwa Subataruies, 
p. 862 f. 

t A. Gray and A. J. O'Leary, Nature, 123, 568.(1929). 
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58. The Dirac equations in spinor form 

We noted in § 53.6 that although Dirac’s equation for the electron 
is invariant under a Lorentz transformation, it can only be written in 
a tensor form in an artificial way, because of the fact that all the 
representations of the Lorentz group are not included in the tensor 
formalism. The necessary extension of the tensor calculus — ^the spinor 
calculus — ^was given by van der Waerden;f with this calculus the equa- 
tions may be written in an automatically co variant form. 

Any pair of quantities (^ 1 ,^ 2 ) which transform according to the 
formulae (166) 


where the coefficients satisfy the condition 


%2 

^21 ^22 


= 1 , 


(157) 


is caUed a spinor of the first rank and is denoted by (k — 1,2). 
Similarly any two numbers which transform like 

■^1 = ^2 “ ^ 2 i %" l "® 22^2 ( f ^^) 

form a spinor which we shall denote by (r = 1, 2). 

The contravariant spinors of the first rank p, are defined by the 
relations mi t 

= S2> b = "“bl> 

= ijj, = -iji, 

which ensure that the scalar products are invariant. 

From these two types of spinors we can build up spinors of higher 
rank. Thus any four quantities transforming like the products 
iiiv ^ 2 ^ 1 * ^ 2^2 spinor of the second rank which we shall 

denote by Similarly second rank spinors of the type can be 
defined. We may also define ‘mixed’ spinors of the second rank as 
quantities transforming like a product of a | and an rj denoted by 
Analogously we can define spinors of higher rank. The main rules of 
operation of spinor calculus are that it is not necessary to fix the ^ 
relative position of dotted and undotted indices of the same spinor, 
and that the complex conjugate of any spinor equation is obtained by 
replacing all the dotted indices by undotted indices and vice versa. 
Two identities of great use in spinor analysis are 

aj^b^ = ( 159 ) 

«**m«*'+®m**’«*+®***®in = (1®®) 

> t B. van der Waerden, 06tt. Naciht. 1929, 100. See also 0. Laporte and G. E. Uhlen- 
vbeck, Phya. Rev. (2), 37, 1380 (1931). 
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their proof follows at once from the definitions. These rules also hold 
for dotted suffixes. 

We now introduce a spinor operator defined by the equations 


II 

dxi^dx^* 


d 

. a 

. 4 , 

^12 — 

dXi 


►5^ 

II 

d 

8 ' 

dxs 

'dXt 

— 

_ d 

+4- 


Now it is obvious from the transformation equations (41) and (42) 
that the four scalar wave-functions Tg, ^l^ of the Dirac theory 
form two spinors of the first rank which we shall denote by ^ and xi* 
The analogue of the four-vector 1,2, 3, 4) will now be a mixed 

spinor <l>^. It follows immediately that the Dirac equations may be 
written in the spinor formf 

0, (161) 
where we have introduced the abbreviation 


(164) 


= (163) 

I 

Eliminating xi from (161) and (162) we obtain 

= 0 - 

Now by means of the identities (159) and (160) we have 
and from the definition (163) of pf we have 

PcxA'y^'-APaX'y^ = 

% 

where is defined by 

Thus the second-order wave equation (164) reduces po the form 


(165) 


m 

2\7 




(166) 


t The summation convention is used throughout, i.e. when a suffix occurs twice the 
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The left-hand side of this equation is identical with that of the spinor 
form of the Klein-Gordon wave equation. The term on the right-hand 
side represents the spin correction. 

In the spinor notation the Darwin Lagrangian (56) may be written 
in the form 

+2m„c('n X^+% 'Fa+Xa Xx)-i9^9i.^> (m 

where the quantities are defined by equation (165). 

Considering X as a function of T, their derivatives, and 

forming the Euler-Lagrange equations as before, we obtain the Dirac 
equations in spinor form together with the Maxwell equations with the 
Dirac current. In this way we obtain spinor expressions for the current 
and the charge density. 


59. Relativistic wave equations for particles of spin greater than 
a half 

Relativistic wave equations for particles of any spin have been 
derived by Diracf and by Pauli and Fierz,| on the assumption that 
each of the scalar wave functions satisfies, in the absence of interaction, 
the generalized second-order wave equation. The resulting equations 
connect two irreducible spinors but may be split by a suitable trans- 
formatiori into two sets. The simplest case in the Dirac scheme (apart 
from the Dirac equations of the electron) is that for a particle with 
twice the electronic spin. In this case we have for a particle of non- 
zero mass Wq in the absence of an electromagnetic field the spinor 
equations ^ 

( 168 ) 

le 6l ^ ^ 


Th^e equations relating the field quantities J5^, ^4^;^ are analogous to 
the Maxwell-Lorentz equations of the electromagnetic field, an essential 
difference being that all the quantities are here assumed to be complex. 
The transition of these equations from spinor to tensor form has been 
effected by Kemmer§ who showed that there are four possible ways of 
writing the equations (168) in tensor form. They are 



t P. A. M. Dirac, Proc. Boy. Soc. A, 155, 447 (1936). 

X W. Pauli and Fierz, Hek. Phya. Acta, 12, 3, 297 (1939); Proc. Roy. 5foc. A, 173, 211 
(1939). 

§ N. KraEuner, ibid. 166, 173. 
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(170) 



8Xfi ^ 8x 

■O 8y°‘^y ,g 

y ^ = K^r 

Qt-*1 “ 

(171) 




(with <f,ap = 


and finally 






^<l>ayS 1 8^0/58 8<^aj5y _ . 

^ g^a^yS 


dx„ 


^ dxy dxs 

Z dx^ ~ 

a==l ® 






(172) 

with 


^aj3y ~ ^j3ay 

'^otyp- 

(173) 


In all these equations a, j8, y, 8 run from 1 to 4 and 

K = m^clh. 


It is immediately obvious that equations (169) are equivalent to the 
Klein-Gordon equation 




(174) 


Equations (170) are those proposed earlier by Procaf for Bose-Einstein 
particles (i.e. particles of spin one). The analogous equations (171) and 
(172) had not been considered prior to Kemmer’s analysis. From these 
equations Kemmer develops four inequivalent possible forms of field 
theory for Bose-Einstein particles. 

It was pointed out by Kemmerf and Duffin§ that the Proca equations 
(170) could be written in the form 



if then we construct a column matrix Y whose ten elements are the 
components of the vector ^ and the distinct components of the tensor 
Xfufii tike Proca equations assume the form 

(176) 

/l-l 

heie t^e satisfy the relations 

%hioh are analogous to the relations a^a^+a^a^ = 28^^ in the Dirac 
Iheecy. 

t Ph^* Badiuin (vii), 7, 347 (1936)# 

X N. Kernmei;, Proc. Boy. Soe. A, 173, 91 (1939). 

I R. i. DufBa, Phya. Rev. 54, 1114 (1938). 
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The algebra defined by ike equation (176) has been examined in 
detail by Kemmer.t Just as in the case of the algebra defined by the 
Dirac a-operators there are 16 linearly independent quantities among 
the four a’s and their multiple products, in the algebra of the j8-operators 
there are 126 independent elements of the algebra formed from the 
multiple products of the four jS’s and the unit matrix. The jS’s may be 
represented by lOx 10 Hermitian matrices. { 

A question which arises naturally out of these investigations is 
whether the behaviour of elementary particles of any integral or half- 
integral spin can always be described by an equation of the type 

= ( 177 ) 

where the p/^ are the usual differential operators, ^ is a constant, and 
the quantities a* are four matrices describing the spin properties of the 
particle. This problem has been investigated by Bhabha,§ who suc- 
ceeded in shomng that for particles of spin greater than 1 the equations 
of Dirac, Pauli, and Fierz cannot be written in the form (177). Bhabha 
postulates that the fundamental equations of the elementary particles mud 
be first-order equations of the form (111) and that all properties of the particles 
must be derivable from these without the use of any further subsidiary 
conditions. As a result of this assumption it is found that the scalar 
wave functions do not satisfy a second-order wave equation as in the 
theory of Dirac, Pauli, and Fierz, but Bhabha considers his assump- 
tion to be more logical than theirs. He finds that each component of 
the wave function satisfies an equation of order greater than two which 
can be written in the form of products of the operators of the usual 
second-order equation. A consequence of these equations is that we 
must interpret every particle of spin greater than one as possessing not 
one but several values of the rest mass. These states of higher rest 
mass are a necessary feature of the theory. It is shown that these , 
higher values of the rest mass are simple rational multiples of the lowest 
mass. For example, particles of spin f , if they exist in nature at all, 
would be expected to appear with two values of the rest mass, m and 
3m, the former being the stable rest mass. Particles of spin 2 should 
they exist, would appear with rest masses m, 2m. 

t N. Kemmer, Proc. Carnh. Phil. Soc. 39, 189 (1943); see also E. Sohr6dinger, Proc. 
Boy. Irish Acad. 48, A, 136 (1943). 

X N. Kemmer, loc. oit. 

§ H. J. Bhabha, Bw. Mod. Phys. 17, 200 (1946). 
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60. Introduction 

In this book we have employed the methods of wave mechanics as 
developed by SchrOdinger and others. Although these methods are the 
most suitable for the applications of the theory to the simpler problems 
of physics and chemistry they were not the first to be put forward to 
replace the older quantum theory of Bohr and Sommerfeld. The first 
system of equations designed to that end was set up slightly earlier 
by W. Heisenberg and developed by Bom, Jordan, and Dirac. Because 
of the fact that the quantities occurring in this system of equations 
ob^d the laws of matrix algebra, the theory became known as ‘matrix 
mechanics’. This formulation of quantum mechanics is mathematically 
equivalent to wave mechanics, but since it is sometimes more con- 
venient to make use of the results in the matrix form we give here 
a very brief outline of the foundations of matrix mechanics without 
going into the detailed solution of special problems. The relation of the 
Heisenberg equations to the Schr6dinger equation is also indicated. 

The theory of matrix mechanics was developed from the phenomeno- 
logical .point of view; from the beginning Heisenberg adopted the 
principle that only physical quantities which are actually observable 
'Should be incorporated in the new theory. Heisenberg proposed to 
build up a theory solely in terms of observable properties of atomic 
systems such as the frequencies, intensities, and polarizations of the 
radiations emitted by such systems and omitting concepts of doubtful 
observability such as the positions and velocities of electrons. The 
relation of matrix mechanics to classical mechanics is more direct than 
that of the SchrOdinger theory because the transition from the classical 
theory to the quantum theory is made here at an earlier stage. The 
transition from classical dynamics to the 8chr6dinger wave mechanics 
corresponds to a later stage in the development of the classical theory; 
indeed, SchrOdinger ’s method consists in replacing the Hs.milton-Jacobi 
equation of classical theory by a wave equation suitable for the dis- 
cussion of atomic processes. ^ 

We shall Ifiegin by recallin^>SOme of the results of classical mechanics. 

61. The classical Poisseii bracket 

In dassioal mechanics a dynamical system possessing n degrees of 
freedom is completely, specified by the generalized coordinates and 
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the generalized momenta (i = Any other dynamical 

variables of the system, say, F, can be expressed uniquely as 
functions of these generalized coordinates and momenta. From any 
two such d 5 mamical variables F, 0 the expression 





^Pi w 


( 1 ) 


may be formed; such a quantity will be called a classical Poisson bracket 
and be denoted by [F, G\. It follows immediately from the definition 
that _ 






dpi 


( 2 ) 


so that in classical mechanics the differentials with respect to the 
variables g'j, may be replaced by a Poisson bracket. In particular 
we may write Hamilton’s canonical equations 


dH 


dH 


in the form qi+[H, q^] = 0, Pi+[H,Pi] = 0. (4) 

The importance of the Poisson bracket in classical mechanics arises 
from the fact that it is invariant under a contact transformation. It 
follows at once from the form (4) of Hamilton’s canonical equations 
that they too are invariant under a contact transformation. 

The equations (4) may be generalized for any dynamical variable, F, 
which does not contain the time explicitly; thus 

^_^ldF,8F.\ ^IdFdH dFdH\ 

Many of the most interesting and useful properties of the classical 
Poisson brackets follow immediately from the definition (1). For 
example, the equations 

br.Pj - = 0 j (gj 


are readily established. The dynamical variables occurring in classical 
mechanics can usually be expressed as polynomials (or power series) 
in the coordinates q^ and the momenta so that Poisson brackets 
involving them can be calculated from jcules Of the type 

= ■ ( 7 ) 

= (8a) 

Ji, 0] = [F„ G], (86) 

all of which follow from the definition (1). 
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>2. The quantum Poisson bracket 

We shall now investigate the analogue of the Poisson bracket in 
quantum mechanics. To do this we consider some of the properties 
of the classical Poisson bracket to see which of them may reasonably 
be taken over into the formalism of quantum mechanics. It follows 
immediately from (8 a) that 

[FiK G,]G,+GJiF,F„ G,] 

and from (86) that the right-hand side reduces to 

[Ji, G,]F2 G^+m, G,]G,+GIF„ G,]F,+GM, G,]. 

Similarly by applying the theorems in the reverse order we can show 
that 

[i; Ji, 0, 0,] = [ii, 0^\G,F,+OlF,, 0,]F,+F,[F,, 0,]G,+F, G^F,, G,]. 
Equating these two values of the Poisson bracket [FiF^yGiGz] we 
obtain the relation 

F,G,-G,F, F,G,-G,F, 

In classical mechanics this result is trivial since the numerator of each 
fraction vanishes identically; but we have seen that in quantum 
mechanics where the dynamical variables are in general operators, 
FO--GF is not always zero. In such a theory we might then interpret 
equation (9) as meaning that the Poisson bracket [F, G] is proportional 
to the expression FG— GF. The constant of proportionality in a special 
case was first calculated by Diracf, who considered the expression 
FG—GF of the quantum theory and found the corresponding formula 
of the classical theory to which it must tend in the limit for large 
quantum numbers. In the classical theory the F and G are functions 
of the generalized coordinates and momenta of a multiply periodic 
system. Dirac found that /or large quantum numbers FO—GF corre- 
sponded to the expression ih[Fy G] of classical mechanics. Generalizing 
this result he postulated that the equation 

FG-GF=^ih{F,G] (10) 

holds in all circumstances. Equation (10) is the basic axiom of the 
quantum mephanics formulated by Dirac and can hav^ no formal proof. 
Its sole justification is that its use leads to results which are consistent 
with the experimental evidence and that it reduces to the classical 
equation FG = GF (11) 

t F- A. M. Dirac, Proe. Soy. Soe. A, 109, 642 (1926). 
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in the limiting case R -> 0. It is assumed that the Poisson bracket . 
[F, 0] is calculated formally as in classical mechanics and obeys the 
same laws as the classical bracket ; equation (10) then supplies informa- 
tion about the difference between the two products FO, GF of the two 
dynamical variables F and G in the quantum theory. 

The quantum Poisson bracket is defined by the relation 

[F,Cf] = ^(FG-GFy, (12) 

it can easily be verified that it has the following properties 

[F,F] = 0, (13) 

[F,c] — 0 (c, scalar), (14) 

as well as the properties (7) and (8). 

63. Angular momentum relations for a single electron 

To illustrate the method of calculation of Poisson brackets in quan- 
tum mechanics we shall evaluate the brackets corresponding to certain 
of the dynamical variables connected with the motion of a single 
electron. If the electron has coordinates (g^, q^, q^) and linear momenta 
iPvPz^Ps)^ we assume that the relations 

[?i.9y] = = h (hj = 1.2, 3) (16) 

of classical mechanics hold good in quantum theory. 

In classical mechanics we define the angular momentum m of a single 
electron by the vector equation 

m = qxp, 

so that its components are 

(Wi.Wjj.ma) = (qifi-qzMiPi-qiViAift-Wi)- 

From the definition of m it follows that 
K.3i] = 

= [ffl. ?i]^ 2 +?i[> 2 . ?i]-[? 2 . 9l]Pl-9lPl,qil 
naing the rules (7) and (8). If we assume the relations (16), each of the 
Poisson brackets vanishes except [pi,?i] = — [ji.pJ = —1, so that 

K.3i] = ?2- 

Similar calculations lead to the results 

[%.3i] = ® [%>32] = = —?2 

K- 9i] = -?8 K. ?2] = 0 K, 92 ] = 9i 

[»»$.9l] = 92 [»»8.92] = -9l [»»8,98] = 0- 


( 16 ) 
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tyjtilaiig r* = q'+gl+ql and making use of equations (8) and (16) we 
obtain the result 


[Wi.r*] = gi[mi,qi]+[mi,gi]gi+g2[mi,g8]+[mi,g'Jg'4+ 


= 0 , 

giving the relations 


H-gsK-gsl+K-gslg!. 


= [wig.r*] = [ffig.r*] = 0. (17) 


Now it can easily be shown from the definition of a Poisson bracket 
in classical mechanics (equation (1) above) that, if [F, G] = 0 for any 
two dynamical variables G, then — 0, where <^((?) is any 

function of G, If we assume this result to be carried over into the 
quantum theory we have from (17) 

[mi,<t>(r)] = [mj,^(r)] = [ms,<t>(r)] = 0. (18) 


^ Corresponding to the set (16) we have the set 

K.Pl] = 0 [^vPt] = ;'3 [»»l.^>8] = -Pt ' 

= -P» = 0 hi, Pa] = Pi 

hi<P^ = 'Pi = -jJi [Wg.pa] = 0 , 

These identities may be easily proved; for instance, 


(19) 


ha>Pi] = biPt-MvPi] 

= bvPi]P2+9lPvPi]-[Mi]Pi-9lPvPi] 


= Pi- 

By a method similar to that employed in establishing the relations (17) 
we can show that 

hvP*] = = [»»8,p*] = 0, (20) 

and, hence that 

K. x(p)] = K. xip)] = K- x(j>)] = 0. (21) 

where xiv) is a function of p. 

For an electron moving under the action of a central force the 
llamiltonian function, H, may be written in the form 

ff = lp*+F(r), 

2/i 

where ii denotes the mass of the electron. Hence by equations (18) ,and 
(21) we have 


[r»i,£r] = = 0. 


( 22 ) 
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Furthermore, from the definition of the angular momentum ■we have 

K>%] = 

= [w^i. 93 ]J>i+381»»i.J>i1-[TOi, ?i]j)8-3i[wii,353l 
“ ^lP2 ?2Pl “ ^3* 

In this way we establish the relations 

[mi,m 2 ] = m 3 , [m^,m^] = mi, m^] = mg, (23) 

from which we may readily deduce 

[mi, m2] = [m 2 , m2] = [m 3 , m2] = 0. (24) 

We have established the relations (16) to (24) by means of the formal 
properties of Poisson brackets in classical mechanics. It is also of 
interest to note that they can be established simply if we make the 
assumption, implicit in the Schrodinger theory, that 

h d 

i dqi 


We then have 


and so, by equation (12), 


1 


d a\ 


The right-hand side reduces to so that the effect of operating on 
the wave-function Y by [m3, q^] is equivalent to multiplying it by ^2- 
This may be written formally as 

K.3i] = ?2 

in agreement with the relations (16). The other relations can be 
established in a similar way. 

64. The angular momentum of a system of electrons 

If the dynamical system consists of several electrons, then the angular 
momentum of the system is taken to be the vector sum of the angular 
momenta of the individual electrons. Thus, if there are n particles with 
angular momenta m<^^ (i = l,2,...,ri) the total angular momentum of 
the system is defined by the vector equation 
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If i ^ j, then it follows at once from the definitions that 

[9i’.9i’] = 0. = 0 (26) 

for i, j = 1, 2,..., w. Now for any particle i 


W.#] = i K> #] = #] = 

j <=1 


In this way it may be established that, for i = 1, 2,..., n, 


= -g«) 
= # 

The relations 

= 0 

= -pf 


[■Mi.#] = # 
- 0 

[Jf,,#] = 

[•Mi,#] = # 
[J4,#] = 0 


[^1,#]=-#'! 

W,#] = # 

[itf,.#] = 0 J 

W,#] = -#] 

[Jlf3,#]==# 

= 0 j 


(26) 


(27) 


(i = l,2,...,n) corresponding to the set of equations (19) can be 
established in a similar fashion. 


Furthermore, 

[M„M,] = m«)]. 

Now by definition 


SO that 

giving one of the relations of the set 


i=l 


[ifi, Jf J == Jfa, [M,, M,] = Jfi, [M,, M,] = iffg, (28) 

the others being established in the same way. 

* Finally, if M® is defined by 

" M» = Ml+Ml+Ml, 

it can be shown from equations (28) that 

[JK;, M®] = [JMa, M®] = [if3, M®] = 0, (29) 

also that = [i^,M®] = 0 (i = 1, 2, 3). (29a) 

Thus the ang]idar momentum identities hold not only for a single electron 
but also for a system of electrons. ^ 

65. The equations of motion 

We have seen that in classical mechanics the Hamiltonian equations 
of motion (3) may be written, in terms of the classical Poisson brackets, 
in the form (4). We now postulate that in quantum mechanics the 
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equations of motion are of the same form except that now the classical 
Poisson bracket is replaced by the quantum Poisson bracket defined 
by equation (12). In this way we obtain the equations of motion 

= = (36) 

(j = 1, for a system with n degrees of freedom. In addition to 
the equations of motion we require new quantum conditions to replace 
the conditions ^ 

(31) 

of the Bohr theory of the atom. These are obtained by assuming that 
the Poisson brackets involving the coordinates and the momenta p^ 
have the same values as in the classical theory. Combining the classical 
equations (6) with the definition (12) of a quantum Poisson bracket 
we obtain the quantum conditions 

mk-Mi = ^ ' 

PjPi-PkPj = ^ • ( 32 ) 

^jPk^Pk^j — ^^^jk j 

This method of deriving the quantum conditions (32) is due to Dirac;t 
they have also been derived from (31) by a method making use of the 
correspondence principle. ^ 

The variation with time of any dynamical variable which does not 
contain the time explicitly is taken to be defined by the classical 
equation (5) with the Poisson bracket replaced by its quantum analogue 
(12), i.e. j 

/ = [F, H] = ^(FH^HF). (33) 

ti% 

For this reason any dynamical variable, a, which satisfies the equation 

= = 0 (34) 

is called a constant of the motion. For example, equations (22) show 
that the components of the angular momentum of a single electron are 
constants of the motion for a central field of force. The Hamiltonian 
is itself a constant of the motion. 

66. Matrix mechanics 

So far we have not specified the nature of the entities q^^ Pj satisfying 
the commutation relations (32), though it is obvious from the third of 
these equations that they are not ordinary numbers. We have seen 

t P. A. M. Dirac, Proc. Roy. Soc. A, 109, 642 (1926). 
t M. Bom, W. Heisenberg, and P. Jordan, Zeits.f. Phya. 35, 667 (1925). 
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previously, however, J that an equation of this type is satisfied if we 
take 




t 0g, 


and interpret it as meaning that the effect of applying the operator 
qjPj'-jpjqj to a function of the p’s and g’s is merely that of multipl 3 dng 
by ih. We can, however, interpret the quantities q^, in quite a different 
way. In the classical theory the coordinate q in any system varying 
j^riodically with the time with period v can be represented by a Fourier 
series of the type oo 

3=2 3(i)«*"‘'’", 

j S= — 00 

where, to ensure that the coordinate q is real, the coefiicients q(j) must 
satisfy the relations j) = ( 35 ) 

for all values of j. In the quantum theory the classical frequencies jv 
are replaced by the spectral frequencies which satisfy the equations 

= 0 , (37) 

^mn ~ “^nrw* 

In setting up a matrix formulation of quantum mechanics, therefore, 
we form, instead of the Fourier series, the matrix of which the elements 

(m\q\n)e^”^^”*nt, ( 39 ) 

The qi^ntities (ni\q\n) replace the numbers q(j) of the classical theory. 
The^ ^atrii^ element (39) corresponds to the transition of the system 
ftpin the state m to the state n. 

Since = 0 for all n, it follows that the elements along the leading 
^i^gonal of this matrix are constant with regard to the time. Further- 
more, if the matrix is to represent a real coordinate, then corresponding 
to (36) we must have (^|^|^) _ (^|^|^), (^0) 

Matrices with this type of symmetry are called Hermitian\ all the 
coordinate matrices of quantum mechanics are assumed to be of this 
type. The matrix with elements (n\q\m)* is called the Hermitian con- 
jugate of q and is usually denoted by q^, A Hermitian matrix is thus 
onptetisfying the equation ^ ^ 

If we have ^o coordinate matrices 
3i = 

q» = ll(m|32l»)e*"‘’'”"‘ll. 

X p. 36 above. 
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equations of motion are of the same form except that now the classical 
Poisson bracket is replaced by the quantum Poisson bracket defined 
by equation (12). In this way we obtain the equations of motion 

= = (36) 

(j = 1, for a system with n degrees of freedom. In addition to 
the equations of motion we require new quantum conditions to replace 
the conditions ^ 

(31) 

of the Bohr theory of the atom. These are obtained by assuming that 
the Poisson brackets involving the coordinates and the momenta p^ 
have the same values as in the classical theory. Combining the classical 
equations (6) with the definition (12) of a quantum Poisson bracket 
we obtain the quantum conditions 

mk-Mi = ^ ' 

PjPi-PkPj = ^ • ( 32 ) 

^jPk^Pk^j — ^^^jk j 

This method of deriving the quantum conditions (32) is due to Dirac;t 
they have also been derived from (31) by a method making use of the 
correspondence principle. ^ 

The variation with time of any dynamical variable which does not 
contain the time explicitly is taken to be defined by the classical 
equation (5) with the Poisson bracket replaced by its quantum analogue 
(12), i.e. j 

/ = [F, H] = ^(FH^HF). (33) 

ti% 

For this reason any dynamical variable, a, which satisfies the equation 

= = 0 (34) 

is called a constant of the motion. For example, equations (22) show 
that the components of the angular momentum of a single electron are 
constants of the motion for a central field of force. The Hamiltonian 
is itself a constant of the motion. 

66. Matrix mechanics 

So far we have not specified the nature of the entities q^^ Pj satisfying 
the commutation relations (32), though it is obvious from the third of 
these equations that they are not ordinary numbers. We have seen 

t P. A. M. Dirac, Proc. Roy. Soc. A, 109, 642 (1926). 
t M. Bom, W. Heisenberg, and P. Jordan, Zeits.f. Phya. 35, 667 (1925). 
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whicfe is satisfied only if 

(wla|n) = 


67. The solution of special problems 

The solution of any problem in matrix mechanics is complete when 
a set of matrices (m\qj\n) has been constructed to represent the 
dynamical variables qj satisfying the conditions: 

(i) The matrices are of Hermitian type. 

(ii) They are such that the Hamiltonian matrix derived from these 
matrices is diagonal. 

(iii) They satisfy the commutation relations (32), 

The further conditions are summed up in the equations of motion 

= ^{FH-HF) 

%n 

satisfied by any dynamical variable F. 

In the condition (iii) 8^^ is interpreted as the infinite matrix 




is known as the Kronecker delta. 

To illustrate the use of the matrix method we shall discuss two simple 
problems in which the purpose is to obtain matrix representations of 
constants of the motion in diagonal form, thus obtaining their allowed 
values in quantum mechanics. 


(a) Rigid body rotating abovi a fixed axis 

Consider the motion of a rigid body capable of rotating about a fixed 
axis. If the body is free from external forces its angular momentum pq 
and its total energy H remain unaltered during the motion. Since the 
angular momentum is a constant of the motion, we may represent it 
by a diagonal matrix 

If ihe angular displacement of the body is denoted by 8, then the 
configuration of the dynamical systemlis repeated W^henever 8 is in- 
cr^sised by 2n7r, where n is any integer. In the equation 

' ^ = [F.Pe]^'^(P0F-Fpi), 
we may replace F by and obtain 
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or, in matrix form, 

Using the fact that (j\'PQ\h) is of the form (43) we find that the sum 
on the right reduces to a simple multiple of to give the relation 

Pi-Pi = 

showing that = ( j+a)^, 

where j is an integer and a is a constant. To show that a is, in fact, 
zero we need only note that positive and negative values of j in this 
equation relate to opposite senses of rotation, but to the same numerical 
value of the angular momentum; that is, 

(j+<x)h = —(—j+0L)h 
or a = 0. 

Finally Pj=j^> 

a result which can readily be established by the methods of wave 
mechanics. 


(b) The harmonic oscillator 

Suppose that the coordinate of the particle is specified by a matrix q 
with elements 


The Hamiltonian of a harmonic oscillator is of the form 


(44) 


H = ^p‘+iKq^. 

The canonical equations are then 

q = pIp, p = -Kq, (46) 

Eliminating p from these equations, substituting for q from equation 
(44), and writing k ;= 47rV§/[x, we obtain 

This equation is satisfied if we take 

(j\a\kl==0 (k^^jTlh 
= ^ 0 - 

Now from the former of equations (46) we have for the matrix p 

(il2?|i) = (47) 

so that 
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and by the commutation rule (32) this must equal — Substi- 
tuting m = j, making use of equations (46), and of the result 


we obtain 






H:' ■ ■ 

ifn/vi,’ • 


which may be written 




"h 


If we assume that |(l|o|0)l* is the first non-vanishing term of the series 
we obtain 

l(jla|i-l)P ^ 


3^ 


4^/iVo’ 

so that the matrix q has elements 
(j\q\k) = 0 

^ . where the j8’s are arbitrary. 


jg-27ri(vo^+ftj+i)^ 


The elements of the matrix p are then deduced from equation (47). 
For the energy levels of the system we have 


= •^V 1 (•'L+»^)l(w|a|»)|* 

m 

= 47TVo{l(^-l|«WIH|(w+l|a|n)|2} 

== (W' + i)^>'o* 


68. Canonical transformations 

Suppose a dynamical system is completely specified by a set of 
coordinates qj and momenta which may be represented by Hermitian 
matrices. If now tlie coordinates are transformed to Qj and the 
momenta to in such a way that the matrices of the new coordinates 
and momenta are related to those of the old by the matrix equations 

Qi^Sq^T, P^=^Sp,T, 
then [g^, PJ = 8{q^ TSp^^pj, T8q^)T. 

Now the old coprdinates and momenta satisfy the commutation 
relations <32); it follows from this last equation that the new coordinates 
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will satisfy relations of this type if the matrices S, T satisfy the 
equations gf ^ fg ^ 

i.e. if r is the inverse, of S. Thus the transformation 

Qi = 8qi8-\ Pi^8p^8-^ (48) 

preserves, ^h6 forln of the commutation relations (32) and is called a 
canonical transformation^ it plays a role in the quantum theory ana- 
logous to that of a contact transformation in classical mechanics. 

The matrix 8 must also be of such a nature that the coordinate Qj 
is real. Writing out equation (48) we have 

[m\Qj\n) = (mlSlfc)(i|?ll)(ll-S-»|n), 

(nlQ^lm)* = g (n\8mi\qmh\8-^\m)*. 

The two are equal if and only if 

((|S-i|w) = (wlS|l)*, 

i.e. if S-i == -St 

or 88^ = 1. (49) 

For the coordinates to be real the matrix 8 of the transformation 
(48) has therefore to be a unitary matrix. 

Furthermore, since 

PJ = P^P^ = 8pi8-^8pi8 = 8pJ8-\ 

Pf = Pj P? = 8pf8-^8pJ8-^ = 8p^8-\ etc. 
it follows by induction that if F{p,q) is a dynamical variable which is 
a function of pp q^ which can be expanded in powers of pp qp then 
in the new system of coordinates 

F(P,Q) = 8F(p,q)8-K 

and in particular the Hamiltonian transforms according to the rule 
H{P,Q) = 8H(p,q)8-\ 

69. Perturbation theory in matrix mechanics 
A perturbation theory can be developed in matrix mechanics just 
as in wave mechanics, and it is often found in applications to particular 
problems that the matrix method gives the required result more quicUy 
or in a more elegant form. Suppose that the Hamiltonian function 
of the d 3 ma)nical system depends on the value of a parameter « whose 
deviation fi^bm zero denotes the amount of the pertmbation, and that 
this function can be expanded in ascending powers of t in the form 
' H(p,q) = H^P,q)+ 1 (50) 

« n“l 
Bb 


6072 
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The problem to be solved in matrix mechanics is the detenhination of 
matrices p, q which satisfy the commutation relations (32) and which 
are such that H(p,q)^W, (61) 

where If is a diagonal matrix. We suppose that the motion of the 
unperturbed system (c = 0) has been solved; i.e. we have determined 
matrices po» % satisfying the commutation relations and such that 

^o(2’o.3o) = ^0. (62) 


where WJ is a (known) diagonal matrix. 

We shall confine ourselves to a discussion of the case in which 
the dynamical system is uniquely specified by a single coordinate q, 
though the analysis may readily* be extended to systems with more 
degrees of freedom. We assume that the matrix W can be expanded 

in the form F = iro+ J (63) 

»=1 

reducing to Wq when e = 0. 

Transform the variables po, q^ to p, q by means of the equations 


p = 8 pq 8 -\ g = 

where iS is a unitary matrix. Then, since Pq, q^ satisfy the commutation 
relations (32), so also do p, q. Furthermore, 

H(p, q) = 8H(p^, qo)8-^y (54) 

so that by equation (61), ' ^ 

8H(po,qo)=^W8. (56) 

Writing 8 = 8^+ 2 (^6) 


and substituting from (60), (63), and (66) into equation (66) and 
equating powers of €, we have 

SoW,^W,8o = 0\ 

(SiFo-Fo Si)-(Fi S,-S,H,) = 0. ,5,) 

• 

remembering that Hq 

From the first of these equations we have ^ 

^ {(m|Solfe)(il Woln)-(mlWo|fc)(fclSoW} = 0, 

which reduces to WJ„j) = 0, 

si^ is a diagonal matrix. If the unpertmbed system is ncm- 
degmeratey ^ unless m = n; thus (mlSo|n) is^ zero \i 
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80 that ^ is,a diagonal matrix. To determine the diagonal elements 
we must now make use of the fact that S is unitary, i.e. 

whence 


SoSl=l, 


8, Sl+8, 81 = 0 , 8„ 81+8, 81+8^ 81 = 0 . 


( 68 ) 

From the first of the equations (58) we see that is of the form 

which can be absorbed into the usual (unwritten) time factor. We 


thus may write 


(m|/9o|n) = 


(69) 


Substituting this value for (m|iSfo|w) in the second of equations (57) we 
{m\8,\n)(Wo„-W^) = (mmn)-(m\H,\ri), (60) 

from which it follows that is not a diagonal matrix and also that 

(n\W^\n) = (n\H^\n). (61) 

Also from equation (60) we have iovm 


• ^0m“ ^On 


(62) 


To find the diagonal elements (w|)Silii) we make use of the second 
of equations (58); since Sq is a unit matrix this reduces to 

Si+Sl = 0, 

giving (w|/8'i|7i)+(w|/Sf |7i) = 0, 

i.e. = 0. 


From the third of equations (67) we have on substituting for the 
matrix elements of Sq, Hq 

(mjWJIn) = {m\H2\n)+ (m|/8i|A;)(A;|^i|7i)+(m|l^i|m)(m|^iln)+ 

+ (»»l'Ssl»)(lFcm-^D- 

Putting m = n and substituting from equation (62) for (m\Si\k) we 
obtain an expression for (n|B^|ii). Writing " 

K = IFo^+€(w|WiW + €2(7llir2|7l) + 0(€3) 

we have finally 

= lfo„+£(n|J?i|»)+e»[(w|.ff,ln)+ J 

( 63 ) 

wMoh is of the same form as equation (25) of Chapter HI. 
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70. Dirac’s generalization of matrix mechanics 

It is -well biovm ftiat eigewaluea of a certain operator may oonto 
a continuous sequence of values as well as the discrete ones; for 
example, in the case of the energy this accounts for the existence ot 
the continuous spectrum. To account for phenomena of this type m 
matrix mechanics we must extend our matrix calculus to include 
systems in which the elements of the matrices will have indices which 
are continuous in one region and assume only certain discrete values 
in another. The methods developed by Diract to deal with this problem 
will now be illustrated in a simple case— that in which the dynamical 
system possesses one degree of freedom; the extension to systems with 
more degrees of freedom is purely formal. 

The first point to be observed is that in any region in which the 
indices vary in a continuous fashion the elements of a matrix 
are each functions of the two variables and ^ 2 * region the 

matrix law of multiplication becomes 

the integration being taken over all the values of in the continuous 
range. In the case of mixed ranges the product will he the sum of an 
integral and a sum over discrete values. In the continuous case the 
place of the unit matrix is taken by the Dirac 8-function which is 
defined by the following relations 

*' CO 

8(a;) = 0 (z # 0), | Hx)dx = 1. (66) 

— 00 

From this definition the following properties may be derived directly 
or as a result of ail integration by parts 

' =/(“)> (®®) 

— eo 

=/'(«). ( 67 ) 

— 00 

— 00 

From equations (64) and (66) it follows at once that 

(fll/lfa) = J 8(fi— r 

t P. A. M, Dirac, Proc, Roy. Soc. A, 113, 621 (1927). 


( 68 ) 
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which shows that in the continuous range the elements of the unit 
matrix are 

Similarly the elements of the diagonal matrix 

/I 0 0 . . . 

0 f 0 . . . 

lo 0 r . . . 

are f )• 

If we write — f'), 

% 

then it follows immediately from the definition (64) that 

(ficxjs-j8c<ii') =1 J {^8(f-r)8'(f-f)-r8'(^-r)8(r-f)}rfr- 

Integrating the second term by parts and the first by the rule (68) 
we have 

m-Mi') = -jj = im-a 

which may be written in the form 

aj3— jSa = ih 

to show that a and jS are canonical conjugates. 

The fundamental problem of the generalized matrix mechanics is the 
discovery of a transformation of the coordinate system which leads to 
a diagonal matrix for the Hamiltonian of the system. Suppose, for 
instance, that we have a representation in which the generalized 
coordinates and momenta are of the form 

Suppose we change the coordinates and momenta to ) und 

(’ll •PI’}') * transformation 

• q = 8QS-\ p = SPS--^ 

to malfB the Hamiltonian function H a diagonal matrix, say H^8(i}— >)'). 
Then, by (54), ^ q) 

that is, /* 

I (i\H{p,q)\i')di'(e\8\v) = / (^|S1’?')«^’?'(’}'|P|’}) 

= J mri')dr,'W,8(r,-n'). 
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The integral on the right-hand side can be evaluated immediately to 

I mif)df(mv) = -(69) 

Equation (69) is Dirac’s generalized form of the Schrbdinger equa- 
tion.! It assumes a simpler form when the Hamiltonian is an 

algebraic function of the variables p, q; for from equations for p, q and 
the results (66) and (67) 




from which it follows by induction that, iif(q,p) is an algebraic function 

r I h d\ 

I {ii/if)dr(f (70) 


Applying this result to equation (69), writing r) = nand(f |/Sf|7^) = 
we have 

■ " -»r„to = o, (71) 


“m)- 


which is the usual Schrodinger equation. The elements of the trans- 
formation matrix /S', which leads to a representation in which the 
Hamiltonian H is a diagonal matrix, are the eigenfimctions /S^(^^). 

The elements of the matrix representing any other dynamical variable 
F are given by the formula 


(ijlflV) = J (v\8-^W)de f (f |J’(g,i>)in<if (f l^h') 


by equation (70), provided, of course, that ^ is an algebraic function 
of the p and g. Now from the condition (49) we have 

so that writing ?; = n, (i\S\ri) = 8^{^) we have 

(nlfln') = J 


t P. A. M. Dirac, Quantum Mechanics^ 2nd ed., p. 116. 



APPENDIX 

PROPERTIES OF ORTHOGONAL FUNCTIONS 

1. Introduction 

A SET of complex functions of the real variable x is said to 
be orthogonal in the interval a < * < j8, if all the functions of the set 
are such that the integral 

J 

OL 

exists for all values of n, and if 

J T* (a:)T„(a:)(fa: = 0 (m 

a 

The quantities a and j8 may be zero, finite, or ±oo. The simplest 
functions of this type defined in the interval — tt ^ a; ^ tt are cosna:, 
sinna;, where n is an integer. The most important property of ortho- 
gonal functions is that under certain conditions we may expand any 
arbitrary function /(a:) as a series of the type 

f{x) = 2 

n 

analogous to Fourier series. Using the orthogonal property it follows 
immediately that the coefficients are given by the formula 

f f(x)Y*(x)dx 


jY„(xm(x)dx 

OC 

If, for all values of n, ^ 

J IY„(x)l^dx=l, 

a 

he functions T^(a;) are said to be normalized; in this case the formula 
3r the coefficients a„ becomes 

jf{x)'^{x)dx. 

a 

Orthogonal polynomials are of great importance in quantum me- 
hanicB. The theory of such polynomials has been treated in great detail 
y SzegO.f No such rigorous treatment will be attempted in this 

t G. Szegd, Orthogonal PolynomicUs (American Math. Soc. Coll. Publ. XXIII, New 
^ork, 1939). 
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appendix; we shall confine ourselves to the derivation of such properties 
of orthogonal pol3moniial8 as are useful in the solution of certain 
elementary problems of wave mechanics. 

2. Hermite polynomials and functions 

In quantum mechanics the Hermite polynomials arise m the solution 
of the Schrodinger equation for the linear harmonic oscillator. It was 
shown in § 1 1 that, in this instance, the wave equation could be reduced 


to the form 


^+(A-y*)'F = 0. 


dy^ 


( 1 ) 


We have to determine the values of the parameter A which lead to 
solutions which tend to zero exponentially as y tends to infinity. The 
solution was obtained by making the substitution 

( 2 ) 

and it was shown that if A = 2n+ 1, (3) 

then H(y) is a pol3niomial of the form 

(4) 

8=0 

Substituting from equation (4) into the equation 

__2y_+2n^ = 0 (6) 

obtained from p. 51 by putting A = 2/1+1, we obtain the recurrence 

- 2(«-n-2)^* 

for the coefficients of y. Taking to be 2", we obtain the solution 


(6) 


The polynomial defined by equation (6) is called a Hermite polyrmnial, 
and the corresponding function 

\{y) = e-iv'HM 

is called a Hermite functim, 

From the form (6) of the Hermite polynomial^it is readily verified 
directly that 


HM = 


(7) 


Differentiating both sides of this equation with resj^ct to ^ we obtain 
the recurrence relation 

= 2yH^(y)^H^^M 


( 8 ) 
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A second differentiation yields 

WM = ( 9 ) 

Now HJy) is a solution of equation (6), so that 

H’M = 2yK(y)-2nHM (10) 

An expression for NJ,+i(y) in terms of and H^+i{y) can be 

obtained from equation (8) by replacing n by »+l! if is 

K+i(y) = (n) 

Eliminating Hl(y), H'^iy), and Hn+i(y) from equations (8) to (11), we 
obtain the recurrence formula 

^yHn^iiy) = H„^z(y)+^in+mAy)- (i2) 


The first two Hermite polynomials can be determined readily from 
equation (7); the remaining ones can then be found by repeated use 
of equation (12). In this way we find for the first eleven polynomials: 

m = 1, 

Hiiy) = 2y, 

Hiiy) = V— 2, 

H»{y) = 82/’-12y, 

Hi(y) = 16y*-48y«+12, 

HM = 32y'>_16(y+120y, 

HM = 64y«-480/+720y2-120, 

Hj(y) = 128y’-1344y»+3360y3-1680t/, 

if,(y) = 266yS-3584y«+13440y‘-13440!/®+1680, 

H^iy) = 512y*-9216y’+48384y5-80640!/»+30240y, 

= 1024yi"-23040yS+161280y'-403200y‘+302400y*-30240. 

3. Integrals involving Hermite functions 
To determine the normalization factors of the eigenfunction of the 
linear harmonic oscillator we require to know the value of the integral 

00 00 

I(m,n) = J \{yWJy)dy = f (13) 

— 00 “OO 

The proof that I(m,n) vanishes unless m = n follows at once from the 
fact that the function ^^(y) satisfies the differential equation 


y*Y„+(2n+l)'r„ = 0, 
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SO that 


00 00 


which is zero unless m — n. 

In particular, /(w,n+2) = 0, so that from equation (12) it follows 
that ^ 

^ = ^[n+\)I(n,n). (14) 

— 00 

Now the left-hand side of this equation may be written 

' —00 —00 
which gives the recurrence formula 

7(n+l,ii+l) = 2(n+l)/(n,n). 

Now 


7(0,0) = J e~y*dy = Vtt, 


so that by induction we arrive at the formula 
I{n,n) = 2'^n\ Vtt. 

Thus the normalized eigenfunctions of the Schrodinger equation 
dW , 1 awi/* a 


(16) 


are 

where 


%{x) = 




.e-»«-^*J7,((xa;), 


(16) 


2^«(w!)M 
a = {^m'pjW)^ 

The matrix element {n\x\m) for the harmonic oscillator is, by defini 
tion, given by the equation 

00 00 

— po —CO 

Making us^ of the recurrence formula (12) we find that 
00 

J ye-v'H^(i/)HJy)dy = ^I{n,m+l)+nl(m,n-l), 

— CP 

and hence by use of the orthogonality relation that 

(n|a;lw) = 0 
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unless m = n±l. Putting m = w+1 and inserting the value of 
I(n+l,?i+l) as given by equation (15), we have finally 




4. Laguerre polynomials 

Closely related to the Hermite poljmomials are the Laguerre poly- 
nomials which may be defined by the equation 

A comparison of equations (7) and (18) shows that the Hermite poly- 
nomials can be reduced to Laguerre polynomials with parameters 
71 = Conversely, Laguerre polynomials can, to a certain extent, 
be reduced to Hermite polynomials, by the relationf 

= wiSjT I (“ > -«■ 

(19) 

The Laguerre polynomials are most easily calculated directly from the 
definition (18); the first ten polynomials are found to be: 

m) = 1. 

m = 1 -^ m = - 1 , 

L»(^) = 2-4|+f , m = -4+2i, m = 2, 
mi) = 6-18^+9f-^^ mi) = -18+18f-3P, 

Lid) = 18-6^^, Lid) = -6. 

It also follows directly from equation (18) that L*(f) is a solution of 
the ordinary differential equation 

Now if we write aJ = g == 


we find that the equation 




g = 0 


( 21 ) 


dx^ dx x* 

occurring in the theory of the hydrogen atom (equation (18) of Chapter 
II) is transformed to 


f^+[2(l+l)-f]^+(i«-l-l)G = 0, 

t A. W. Uspensky, Arm, Maih. (2), 28, 693 (1927). 


( 22 ) 
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which is identical with equation (20) if we choose 
n==2i+l, fc = 2Z+5, . 

where ^ is a positive integer and a = 2(^+5). Thus the solution of 
equation (21) may be written in the form 

g = (23) 

00 

The integral il,n-.kx = / 

0 

is of importance in wave mechanics; its value is given by the following 
formula due to Schrodinger 

= (-1)*+*'^! fc'! X 

X V (24) 

^^3\(k-n-8)\(p-k+8)\(k'-n'-s)\(p'-k'+s)V ' ' 

where a is the smaller of the two integers h—n, k'—n\ 

In deternuning the normalization factors for the eigenfunctions of 
the hydrogen atom we require the value of the integral 

0 

In this instance k—n = k'—n* = 5—1 and all the terms in the sum on 
the right-hand side of equation (24) vanish except those for which s 
takes on the values 5—1, 5—2, Equation (24) then yields for the value 
of this integral 

J . (26) 

. ' 0 

5« The ^gendre polynomials 

An equation Of great importance in mathematical physics is the 
Legendre equation 

; = ( 26 ) 

in which I is a constant (but not necessarily an integer). If we wish to 
obtain polynomial solutions of this equation of the form 

y = (a»#0), (27) 

fl -0 

then substituting from equation (27.) into equation" (26) and equating 
to z^ro the coefficient of (jl^ we obtain the relation 

a„[n(n+l)-l(l+l)] = 0. 


( 28 ) 
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Since by hypothesis % ^ 0, it follows that polynomial solutions oi‘ 
equation (26) of the type (27) are only possible if Hs a positive integer. 
Substituting from (27) into (26) and equating coefficients of jn® we obtain 
the recurrence formula 



(g— ?)(g-[-Z-fl) 
(s+l)(s+2) ' 


(29) 


for the coefficients whence we derive the polynomial solution 


PnW = y 


s =»0 


(-l)»(2ri-2.)! . 

2"s!(ii— 5)!(n— 25)!^ 


(30) 


where the integer a is Jn or |(n— 1) according as n is an even or an 
odd integer. P„(jLt) is called the Legendre polynomial of degree n. 
From this definition of the Legendre polynomial we deduce the 
Rodrigues formula 


PnW = 2 

8 = 0 


2»s!(»— s)! dfi”’ ^ 


) 


I d- f i-lfnl 
2^n\ dfjL^ ^ s! (n— 5) ! ^ 




(31) 


6. Integral expression for the Legendre polynomial 

If we assume Cauchy’s theorem in the form 


where /(J) is an analytical function of the complex variable in a certain 
domain B which includes the point C = fi, and where the integral is 
taken along a closed oontout G which includes S = /x and lies wholly 
within the domain R, then by differentiating both sides of the equation 
I times with respect to fi we obtain the result 


dfjJ' 




Substituting the function (£*— 1)^ for /(C) we obtain, by an application 
of Rodrigues* formula, 

= ( 32 , 
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Equation (32) is known as Schlaefli's formula for the Legendre poly- 
nomial. Using the identity 

and noting that if the second term on the right-hand side of this identity 
is integrated round a contour of the type specified at the beginning of 
this section it gives a zero contribution, we arrive at the relation 

Differentiating both sides of this equation with respect to [jl we obtain 
the recurrence formula 

p;+,(^)-~/.p;(,x) = {?-fi)i^(^). (33) 

In a similar fashion we can establish that 

(Z-f-l)P^+i(/i)— (2Z-fl)/ii5W+^-^z-i(/^) ~ 

Differentiating this last equation with respect to /i and substituting for 
Pi+i(fi) from equation (33), we obtain the relation 

/xp;(^)-p;„,(^) = iPM. (35) 

Elimination of /xPJ(/x) from equations (33) and (35) yields the result 

(36) 

and finally the elimination of PJ_i from (35) by means of equation 
(33) with I— I written for I gives 

(^^-l)P[(fi) = IfiPM-lPM- ( 37 ) 

Now the expressions for Po(ft) and Pi(ft) can be derived immediately 
from either of equations (30) and (31); they are found to be 1 and 
respectively. The expressions for PJ^ix) (n ^ 2) then follow by repeated 
applications of the recurrence formula (34). In this way we obtain for 
the first six Legendre polynomials; 

. io(M) = A(f^) = 4(6/^*“-3/x), 

Pi(/i) = /i, P,(l^) = i(35,x^~^0/x2+3), 

W - i(3/^^-l), P,(fi) = i(63|u;-70,x8+15/x). 

7. The associated Legendre polynomials 
The equation 

(i-»*5g-v|+[l('+i)-j^]»-o, (38) 

which reduces to the Legendre equation (26) when w = 0, also plays 
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Since by hypothesis % ^ 0, it follows that polynomial solutions oi‘ 
equation (26) of the type (27) are only possible if Hs a positive integer. 
Substituting from (27) into (26) and equating coefficients of jn® we obtain 
the recurrence formula 



(g— ?)(g-[-Z-fl) 
(s+l)(s+2) ' 


(29) 


for the coefficients whence we derive the polynomial solution 


PnW = y 


s =»0 


(-l)»(2ri-2.)! . 

2"s!(ii— 5)!(n— 25)!^ 


(30) 


where the integer a is Jn or |(n— 1) according as n is an even or an 
odd integer. P„(jLt) is called the Legendre polynomial of degree n. 
From this definition of the Legendre polynomial we deduce the 
Rodrigues formula 


PnW = 2 

8 = 0 


2»s!(»— s)! dfi”’ ^ 


) 


I d- f i-lfnl 
2^n\ dfjL^ ^ s! (n— 5) ! ^ 




(31) 


6. Integral expression for the Legendre polynomial 

If we assume Cauchy’s theorem in the form 


where /(J) is an analytical function of the complex variable in a certain 
domain B which includes the point C = fi, and where the integral is 
taken along a closed oontout G which includes S = /x and lies wholly 
within the domain R, then by differentiating both sides of the equation 
I times with respect to fi we obtain the result 


dfjJ' 




Substituting the function (£*— 1)^ for /(C) we obtain, by an application 
of Rodrigues* formula, 

= ( 32 , 
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The associated Legendre functions can be deduced from the table 
of Legendre polynomials at the end of § 6 by means of the former of 
the relations (39). In this way we obtain: 

pKm) = (1-^*)^ 

= 3M1-M*)*. PIW = 

PJW = Flip,) = 16^(1-^*), PKp.) = 16(1-,.*)*, 

PKm) = i(36,i»-16,.)(l-,.*)*, PK,.) = J(106,.»-16){1-,.*), 

FliljL) = 106,.(1-,.*)*, Pii,,) = 106(1-,.*)*, 
PIW = i(316,.*-210,.*+16)(l-,.*)*, 

P*(,.) = i(31V-106,.)(l-,.*), 
i«(,.) = i(946,.*-106)(l-,.*)*, 

Pt(^) = 946,.(1-,.*)*, 

PK,.) = 946(1-,.*)*. 


8. Integrals involving the associated Legendre functions 

1 


If we write Iff = J Pf(,i)Pj;'(,.) dix, 

-1 

then, by equation (39), 


(42) 


(_l)2u-W’ 

2'+^l!l 


r r 


as a result ofV-\-u integrations by parts. Expanding the second factor 
in the integrand by Leibnitz’s theorem we find 

” 2'^^^!^'! Zs!(r+M-8)! 

1 

r d^+u+et 

i -1 ^ 

Now the term of highest order in the expansion of (jn^— 1)« ig so 
1)" vanishes if 5 ^ I'—u. Similarly the second factor 


that • 


in the integrand vanishes if 5 < l—u. Since the integral is symmetrical 
there is no loss of generality involved in the assumption that I < 
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but if Z < Z' the above two relations limiting the values which a is 
capable of assuming are incompatible, so that 


= Z#Z'. (44) 

If Z = l\ then 8 = l—u is the only term of the series (43) which gives 
a non-vanishing contribution to the final result. We then have 


J..U a+«)! f 

a - 2a(i!)2 (i_tt)!(2M)l J 

_ (i+w)! 2 

~(l-u)\2l+l' 


(ja*-iy(2M)!(2?)!£?fi 


(46) 


The analysis for proceeds along similar lines; again we find 

that the integral vanishes unless 1 = 1'. 

By means of the recurrence formula (40) we have 
1 

“1 

SO that, by equation (44), 

1 

j ^iPn|J.)Pjt.(^l)d^i = Q (46) 


unless Z' = Z+1. When Z' = Z±1 the value of the integral follows 
immediately from equation (45). 

Similarly the recurrence relation (41) yields the formula 
1 

J (47) 

-1 


so that this integral also vanishes unless 


Z' = Z±1. 


9. Some expansions involving Legendre functions 
From Schlaefli’s integral formula (32) for the Legendre polynomial 
P^(/i) and the definition (39) of the associated Legendre function, we 
see that when m and n are real integers 




1 (n+m) ! 

27rt 2*^1 




(C»-i)” 


.^)»+m+l 


di, 


(48) 


where the contour G encloses both of the points £ = 1 and J ! 
If /a > 0 we may take the contour C to be the drole 

00 


:/i. 


son 
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Integrating round this contour we obtain from (48) the equation 

27r 

COS I . 


2 ^j ^ sm ^ ^ (ri+m) Ism 

0 

from which follows immediately the Fourier expansion 
O.+V(m*-1)co8(^-0)}« = PM+2 2 ^^^P!?(/t)co8m(^-^) 


(49) 


Changing n to — (n+1) we obtain the expansion 

xm (»-»»)! 


= P>')+2 2 (-1)* 


m-1 


n\ 


(50) 

P^(/i')COSW0, 

(51) 

where, as noted above, we have assumed n to be an integer. 

Applying ParsevaFs theorem for Fourier series to the series (50) and 
(51) we find that the series 

converges to the sum 

J l)cO80}"+‘ 


This integral may be evaluated by means of Cauchy^s theorem.f Its 
valueisfoundtobePJ/xjLt'+>y/{(/i2— 1)(^'2— i)}cos<^]. Writing/ti = cos0, 
= cos 9', and 

COS0 = cos^cos^'+sin^sin^'cos^. 


we obtain the result 

P„(OO80) = P„(CO80)P„(CO8(?') + 2 2 |j^PS‘(co8(?)P«(cose'). 

^ ' (52) 

This result is often of value in the solution of problems in quantum 
mechanics* 

Another expansion which is of use in the relativistic theory of the 
Auger effeotf and of the interaction of two electrons in the self-, 
consistent field method§ is the following: ^ 


= 

ri-r2l h fa 

t E. W. Hobson, Tht Theory of SphericaL and EUipaoidcd Hahnonica, 365-71 (Cam- 
bridge, 1931). 

t H. S. W. Massey and E. H. S. Burhop, Proc. Boy. Soc. A, 153, 661 (1936). 

§ B. Swirles, ibid. 157, 680 (1936). 
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(64) 


where fg is the smaller and the greater of and r, and 
%(x) = ^(\7rx)J„^(z) 

£«(») = V(i««)W+l{*)+(-lN-«-l(*)} 

A similar expansion for the real exponential function can also be found; 

it is , I 00 

= S (66) 


where r^^ and are defined as before. In equations (64) and (66) the 
functions denote the usual Bessel functions in 

the notation adopted by Watson.f 
Partial 'differentiation of equation (66) with respect to k leads to 
other expansions of importance: 


e-*|r,-r,| g, rg)P„(c08I^h,), 


( 66 ) 


where 


and 


Iri-rile'*’'''''*' = T O-PJcosi^Vj), (67) 

T'a) = (68) 


O = (’i+>f)4+j{*»’«)^+j(*»’»)- 

If formulae for higher powers are needed they may be obtained by 
further partial differentiations with respect to k. These expansions 
facilitate the evaluation of certain complicated integrals occurring in 
the theory of molecular structure.^ 


10s Surface spherical harmonics 

From the two sets of orthogonal functions Pf (cob 8) and we can 
form a third set of functions 

YM = (-1)“[^ |^]*Pr(oo8e)c««^(2,r)-*, (60). 

which is an orthogonal set on the surface of the unit sphere, i.e. the 
functions of the set satisfy the integral relation 

^ ir Sir 

J Sm^d^ J* 

0 0 

t 0. N. WtttooQ, The Theory cfBeenl FmeUone <2nd ed., Cambridge, 1944). 

X C. A. Couleoii, Proc. Comb, PhU. Soe, 104 (1987). 

•on oc2 
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The functions possess the further property that they are 

solutions of Laplace’s equation in polar coordinates when djdr is taken 
to vanish identically. Furthermore, they satisfy a large number of 
recurrence relations which may readily be written down from a know- 
ledge of the recurrence relations satisfied by the associated Legendre 
functions Pf(cos0). Some of these results which arise frequently in 
calculations in quantum mechanics are given below for reference. 
Their derivation follows directly from the results of the previous 
sections. 

wW) 


= h{(l-u){l+u+ ^), (63) 

<!>) = ft{(?+«)(j-M+ i)}»y^„_,(0, 4 ), (64) 

= ( 66 ) 

UY^J,e,4) = l{l^lWY^J,d,4). ( 66 ) 
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neutrons by a nucleus, 246. 

Schottky defect, 207. 

— emission, 219. 

BohrOdinger equation, 12, 28, 38, 61, 374. 

Screening constant, 186. 

Second-order radiation processes, 274. 

Selection rules, 257. 

Self-consistent fidd, 119, 138, 214. 

Self-energy, 276. 

Semi-conductors, 224. 

Simple harmonic oscillator, 60, 63, 261> 
367. 


Size of the electron, 277. 

Specific heats, 41, 46, 216. 

Spin coordinates, 108; greater than one- 
half, 363 ; matrices, 97, 298 ; of electron, 
89, 290, 303. 

Spin-orbit coupling, 321. 

Spinors, 351. 

Stationary states, 34 et seq. 

Statistical method of Thomas and Fermi, 
166. 

StaUstics of particles, 107. 

Stem-Gerlach eiqDeriment, 84, 99, 333. 
Surface spherical harmonics, 387. 
Susceptibility, diamagnetic, 141. 
Symmetrical top molecules, 61. 

Symmetry * characteristics of wave 
functions, 106. 

Tetrahedral orbitals, 196. 

Thermionic emission, 218. 

Third-order radiation processes, 274. 
Thomas-Fermi atom, 166. 

Transformation functions, 34, 66. 
Transition probability, 247. 

Transitions to unquantized states, 249. 
Transmission coefficient, 16. 

Transparency of a barrier, 16, 22. 
Transverse self-energy, 276. 

Trapping of electrons, 227, 230. 

Triangular barrier, 18. 

Trigonal orbitals, 196. 

Tunnel effect, 16. 

Two-body problem, 102 et seq. 
Two-quanta annihilation, 275, 315. 

Uncertainty principle, 30. 

Unitary matrix, 368. 

Vacant lattice points, 206, 206. 

Valence forces, 176. 

Van der Waals’ forces, 164 et seq., 204. 
Vibration of a molecule, 162. 

Vibrational states of diatomic molecules, 
46, 103. 

Water molecule, 193. 

Wave functions, 106; for many-electron 
atoms, 127 ; for the spin, 92. 

Wave velocity, 24. 

WEB method, 19. 

Width of spectral lines, 268. 

Wigner’s perturbation method, 81. 

Zeeman effect, 86, 89, 91. 
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